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PREPARATION OF FUNCTIONAL HUMAN UROKINASE PROTEINS 
10 Reference to Related Application 

This is a continuation-in-part of application USSN 06/358773, 
filed April 15, 1982. 

15 Field of the Invention 

The present invention relates to human urokinase, to novel forms 
and compositions thereof and particularly to means and methods for 
the preparation in vitro of functional protein species of human 

2o urokinase. 

The present invention is based in part on the discovery of the 
DNA sequence and deduced amino acid sequence of native urokinase as 
well as associated portions of the urokinase molecule found to be 
the functional bioactive moieties. This discovery enabled the 

25 production of urokinase in various forms via the application of 
recombinant DNA technology, in turn enabling the production of 
sufficient quality and quantity of materials with which to conduct 
requisite biological testing identifying the biologically 
functional, hence useful moieties of the molecule. Having 

30 determined such, it was possible to tailor-make functional species 
of urokinase via genetic manipulation and in vitro processing, 
arriving efficiently at hitherto unobtainable commercially 
efficacious amounts of active products. This invention is directed 
to these associated embodiments in all respects* 

35 
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The publications and other materials hereof used to illuminate 
the background of the invention, and in particular cases, to provide 
additional details concerning its practice are incorporated herein 
by reference* and for convenience, are numerically referenced in the 
following text and respectively grouped in the appended bibliography. 

Background of the Invention 
A. Human Urokinase 

The fibrinolytic system is in a dynamic equilibrium with the 
coagulation system, maintaining an intact, patent vascular bed. The 
coagulation system deposits fibrin as a matrix serving to restore a 
hemostatic condition. The fibrinolytic system removes the fibrin 
network after the hemostatic condition is achieved. The 
fibrinolytic process is brought about by the proteolytic enzyme 
plasmin that is generated from a plasma protein precursor 
plasminogen* Plasminogen is converted to plasmin through activation 
by an activator* 

Urokinase is one such activator* It and another activator, 
streptokinase, are currently commercially available* Both are 
indicated for the treatment of acute vascular diseases such as 
myocardial infarct, stroke, pulmonary embolism, deep vein 
thrombosis, peripheral arterial occlusion and other venous 
thromboses* Collectively, these diseases account for major health 
hazards and risks- 

The underlying etiological basis for these diseases points to 
either a partial, or in severe cases, total occlusion of a blood 
vessel by a blood clot — thrombus or thromboembolus* Traditional 
anticoagulant therapy, as with heparin and coumarin, does nothing to 
directly enhance dissolution of thrombi or thromboembol i . 
Streptokinase and urokinase have enjoyed practical and effective use 
as thrombolytic agents. Until now, however, each has suffered from 
severe limitations. Neither has demonstrated a high affinity for 
fibrin; consequently, both activate circulating and fibrin-bound 
plasminogen relatively indiscriminately. The plasmin formed in 
circulating blood is neutralized rather quickly and lost for useful 
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thrombolysis . Residual plastnin will degrade several clotting factor 
proteins, for example, fibrinogen, Factor V and Factor VIII, causing 
a hemorrhagic potential* In addition, streptokinase is strongly 
antigenic and patients with high antibody titers respond 
inefficiently to treatment and cannot remain on continuous 
treatment. Urokinase therapy is expensive, owing to its involved 
isolation from human urine or tissue culture, and it therefore is 
not generally accepted in clinical practice. Urokinase has been the 
subject of numerous investigations — See, for example, references 
1-9* Presently available urokinase, as defined, is isolated from 
human urine or tissue culture, e.g. kidney cells (9A,9B). 

The urokinase molecule exists in several biologically active 
forms ~ high molecular weight (ca. 54000 daltons) and low molecular 
weight (ca. 33000 daltons), each composed of single chain or two 
chain material* The low molecular weight form is derived from the 
high molecular weight form by enzymatic cleavage- Biologically 
active material contains the so-called serine protease portion 
linked, in active form, to a second chain via a disulfide bond. Any 
activity ascribed to the high molecular weight material is believed 
to be due to the similar presence of these two connected chains, the 
strategic disulfide bond and interruption in the sequence doubtless 
being located in the serine protease portion of the overall 
molecules (See Figure A). In any event, until the present 
invention, the identity, and hence function, 0 f the ca, 21000 dalton 
residue was unknown and the assignment of activity to one or another 
of the known moieties of urokinase was not uncontrovertedly possible* 

Recently, there was a report of another form of urokinase 
peptide having low, but specific activity (10, 10A). It was 
speculated that this material corresponds to native urokinase, a 
preform of the previously isolated active species described above, 
most probably consisting of a single chain. 

Previous attempts to clone the requisite gene for urokinase with 
attendant hopes of attaining expression in a microbial host were not 
believed successful (11, 11A). See also (6). 
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It was perceived that the application of recombinant DNA and 
associated technologies* after all, would be a most effective way of 
providing the requisite large quantities of high quality, bioactive 
human urokinase, essentially free of other human protein, and 
derivatives thereof that retain functional bioactivity, thus 
admitting of the use of such materials clinically in the treatment 
of various vascular conditions or diseases* 

B. Recombinant DNA/Protein Biochemistry Technology 
Recombinant ONA technology has reached the age of some 
sophistication. Molecular biologists are able to recombine various 
DNA sequences with some facility, creating new DNA entities capable 
of producing copious amounts of exogenous protein product in 
transformed microbes and cell cultures. The general means and 
methods are in hand for the in vitro ligation of various blunt ended 
or "sticky" ended fragments of DNA, producing potent expression 
vehicles useful in transforming particular organisms, thus directing 
their efficient synthesis of desired exogenous product. However, on 
an individual product basis, the pathway remains somewhat tortuous 
and the science has not advanced to a stage where regular 
predictions of success can be made. Indeed, those who portend 
successful results without the underlying experimental basis, do so 
with considerable risk of inoperability. 

DNA recombination of the essential elements, i.e*» an origin of 
replication, one or more phenotypic selection characteristics, an 
expression promoter, heterologous gene insert and remainder vector, 
generally is performed outside the host cell. The resulting 
recombinant replicable expression vehicle, or plasmid, is introduced 
into cells by transformation and large quantities of the recombinant 
vehicle obtained by growing the transformant. Where the gene is 
properly inserted with reference to portions which govern the 
transcription and translation of the encoded DNA message, the 
resulting expression vehicle is useful to actually produce the 
polypeptide sequence for which the inserted gene codes, a process 
referred to as expression. The resulting product may be obtained by 
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lysing, if necessary, the host cell, in microbial systems, and 
recovering the product by appropriate purification from other 
proteins. 

In practice, the use of recombinant ONA technology can express 
5 entirely heterologous polypeptides — so-called direct expression — or 
alternatively may express a heterologous polypeptide fused to a 
portion of the amino acid sequence of a homologous polypeptide. In 
the latter cases* the intended bioactive product is sometimes 
rendered bioinactive within the fused, homologous/ heterologous 

10 polypeptide until it is cleaved in an extracellular environment. 
See references (12) and (13) ♦ 

Similarly, the art of cell or tissue cultures for studying 
genetics and cell physiology is well established* Means and methods 
are in hand for maintaining permanent cell lines, prepared by 

15 successive serial transfers from isolate normal cells. For use in 
research* such cell lines are maintained on a solid support in 
liquid medium, or by growth in suspension containing support 
nutriments. Scale-up for large preparations seems to pose only 
mechanical problems* For further background, attention is directed 

20 to references (14) and (15) . 

Likewise, protein biochemistry is a useful, indeed necessary, 
adjunct in biotechnology. Cells producing the desired protein also 
produce hundreds of other proteins, endogenous products of the 
cell's metabolism. These contaminating proteins, as well as other 

25 compounds, if not removed from the desired protein, would prove 
toxic if administered to an animal or human in the course of 
therapeutic treatment with desired protein. Hence* the techniques 
of protein biochemistry come to bear, allowing the design of 
separation procedures suitable for the particular system under 

30 consideration and providing a homogeneous product safe for intended 
use. Protein biochemistry also proves the identity of the desired 
product characterizing it and ensuring that the cells have produced 
it faithfully with no alterations or mutations. This branch of 
science is also involved in the design of bioassays, stability 

35 studies and other procedures necessary to apply before successful 
clinical studies and marketing can take place. 
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Summary of the Invention 

The present invention is based upon the discovery that recombinant 
DNA/protein biochemistry technology can be used to successfully 
produce human urokinase in the form of biologically functional , 
5 tailored species* This invention provides active urokinase protein 
suitable for use, in all of its forms, in the prophylactic or 
therapeutic treatment of human beings for various vascular 
conditions or diseases* Each of its forms includes the bioactive 
moiety, to wit, the enzymatic portion of native material believed to 

10 reside in a 2-chain region comprising the serine protease portion. 
In accordance with this invention, a series of urokinase active 
products can be prepared,, either directly in bioactive form or 
notably in a form available for in vitro processing to result in 
bioactive product. This invention also provides the means and 

15 methods for producing full length native urokinase molecules 
particularly in bioactive or bioactivatable form, having the 
potential added advantage of specific affinity for fibrin not 
demonstrated until now with any urokinase product isolated from 
natural sources. Thus provided is human urokinase product having 

20 the potential new property of specific activity toward tangible, 
extant thrombi. The products being produced by cell culture 
harboring recombinant DNA encoding respective product entity, the 
facilities are now at hand to produce human urokinase in a. much more 
efficient manner than has been possible and in forms exhibiting 

25 enhanced biologically significant properties. In addition, 

depending upon the host cell, the urokinase activator hereof may 
contain associated glycosylation to a greater or lesser extent 
compared with native material. 

The present invention comprises the human urokinase products 

30 thus produced and the means and methods of production. The present 
invention is further directed to replicable DMA expression vehicles 
harboring gene sequences encoding the enzymatic portion of human 
urokinase in expressible form. Further, the present invention is 
directed to microorganism strains or cell cultures transformed with 

00 the expression vehicles described above and to microbial or cell 
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cultures of such transformed strains or cultures, capable of 
directing production of the human urokinase products hereof. In 
still further aspects, the present invention is directed to various 
processes useful for preparing said urokinase gene sequences, DMA 
5 expression vehicles, microorganism strains and cell cultures and to 
specific embodiments thereof. Still further, this invention is 
directed to the preparation of fermentation cultures of said 
microorganisms and cell cultures. 

Reference herein to the expression "human urokinase" connotes 

10 polypeptide in bioactive form, produced by microbial or cell culture 
or optional in vitro processing and comprising the enzymatic portion 
corresponding to native material. Human urokinase* according to the 
present invention, is thus provided 1) in full length, in 
contradistinction to material hitherto isolated from natural sources 

15 or 2) in other* bioactive forms bearing the sites of the enzymatic 
portion found essential for plasminogen activation or 3) having a 
methionine first amino acid or a signal polypeptide or conjugated 
polypeptide other than the signal polypeptide fused at the 
N-terminus of the enzymatic portion, the methionine, signal or 

20 conjugate polypeptide being specifically cleavable in an intra- or 
extracellular environment (See reference 12) * In any event, the 
thus produced human urokinase polypeptides are recovered and 
purified to levels fitting them for use in the treatment of various 
cardiovascular conditions or diseases. 

25 

Description of Preferred Embodiments 
A* Microorganisms/ Cell Cultures 

1. Bacterial Strains/ Promoters 

The work described herein was performed employing, inter 
30 alia , the microorganism E. coli K-12 strain GM 48 (thr~, leu", 

B x ~\ lacYl, gal K12, gal T22, ara 14, ton A31, tsx 78, Sup E44, 
dam 3, dcm 6) deposited with the American Type Culture Collection on 
April 9, 1982 (ATCC No. 39099) and E. coli K-12 strain 294 (end A, 
thi~, hsr~, k hsm + ), described in reference (16), deposited 
35 with the American Type Culture Collection, ATCC Accession No. 31446, 
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on October 28, 1978. However, various other microbial strains are 
useful, including known JE. coli strains such as E. coli B, E» coli X 
1776 (ATCC No. 31537, deposited July 3, 1979) and E. coli W 3110 
(F~, x~, protrophic) (ATCC No. 27325 deposited February 10, 

5 1972), or other microbial strains many of which are deposited and 
(potentially) available from recognized microorganism depository 
institutions, such as the American Type Culture Collection 
(ATCC) — cf * the ATCC catalogue listing. See (17). These other 
microorganisms include, for example, Bacilli such as Bacillus 

10 subtilis and other enterobacteri aceae among which can be mentioned 
as examples Salmonella typhimurium , Serratia marcesans , and 
Pseudomonas . utilizing plasmids that can replicate and express 
heterologous gene sequences therein. 

As examples, the beta lactamase and lactose promoter 

15 systems have been advantageously used to initiate and sustain 

microbial production of heterologous polypeptides. Details relating 
to the make-up and construction of these promoter systems can be had 
by reference to (18) and (19). More recently, a system based upon 
the tryptophan pathway, the so-called trp promoter system, has been 

20 developed. Details relating to the make-up and construction of this 
system can be had by reference to (20) and (21). Numerous other 
microbial promoters have been discovered and utilized and details 
concerning their nucleotide sequences, enabling a skilled worker to 
ligate them functionally within pi asmid vectors, have been 

25 published. — See (22). 

2. Yeast Strains/ Yeast Promoters 

The expression system hereof may also employ a pi asmid 
which is capable of selection and replication in either or both 
E m coli and/or yeast, Saccharomyces cerevisiae . For selection in 

30 yeast the pi asmid may contain the TRPl gene (23, 24, 25) which 

complements (allows for growth in the absence of tryptophan) yeast 
containing mutations in this gene found on chromosome IV of yeast 
(26). A useful strain is strain RH218 (27) deposited at the 
American Type Culture Collection without restriction on December 8, 

35 1980. (ATCC No. 44076). However, it will be understood that any 
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Saccharomyces cerevisiae strain containing a mutation which makes 
the cell trpl should be an effective environment for expression of 
the plasmid containing the expression system, e-g*, strain pep4~l 

(28) . This tryptophan auxotroph strain also has a point mutation in 
TRP1 gene* 

When placed on the 5' side of a non-yeast gene the 
S'-flanking DNA sequence (promoter) from a yeast gene (for alcohol 
dehydrogenase 1) can promote the expression of a foreign gene in 
yeast when placed in a plasmid used to transform yeast. Besides a 
promoter, proper expression of a non-yeast gene in yeast requires a 
second yeast sequence placed at the 3'-end of the non-yeast gene on 
the plasmid so as to allow for proper transcription termination and 
polyadenylation in yeast* In the preferred embodiments, the 
5 1 -flanking sequence of the yeast 3-phosphoglycerate kinase gene 

(29) is placed upstream from the structural gene followed again by 
DNA containing termination - polyadenylation signals, for example, 
the TRP1 (23-25) gene or the PGK (29) gene. 

Because yeast 5 '-f Tanking sequence (in conjunction with 3" 
yeast termination DNA) (infra) can function to promote expression of 
foreign genes in yeast, it seems likely that the 5 '-flanking 
sequences of any yeast gene could be used for the expression of 
important gene products, e.g., glycolytic genes such as e.g., 
enolase, glyceraldehyde - 3-phosphate dehydrogenase, hexokinase, 
pyruvate decarboxylase, phosphofructokinase, glucose - 6~phosphate 
isomerase, 3-phosphoglycerate mutase, pyruvate kinase, 
triosephosphate isomerase, phosphoglucose isomerase, and 
glucokinase. Any of the 3*~flanking sequences of these genes could 
also be used for proper termination and mRNA polyadenylation in such 
an expression system. 

Finally, many yeast promoters also contain transcriptional 
control so they may be turned off or on by variation in growth 
conditions. Some examples of such yeast promoters are the genes 
that produce the following proteins: AlcohoT dehydrogenase II, acid 
phosphatase, degradative enzymes associated with nitrogen 
metabolism, glyceraldehyde -3-phosphate dehydrogenase, and enzymes 
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responsible for maltose and galactose utilization (30). Such a 
control region would be very useful in controlling expression of 
protein product - especially when their production is toxic to 
yeast. It should also be possible to combine the control region of 
one S'-flanking sequence with a 5'-flanking sequence containing a 
promoter from a highly expressed gene. This would result in a 
hybrid promoter and should be possible since the control region and 
the promoter appear to be physically distinct DNA sequences* 
3, Cell Culture Systems/ Cell Culture Vectors 
Propogation of vertebrate cells in culture (tissue culture) 
has become a routine procedure in recent years. (See 31). A useful 
host for the production of heterologous protein is the COS-7 line of 
monkey kidney fibroblasts (32). However, the present invention 
could be practiced in any cell line that is capable of the 
replication and expression of a compatible vector, e.g., WI38, BHK, 
3T3, CHO, VERO, and- HeLa cell lines. Additionally, what is required, 
of the expression vector is an origin of replication and a promoter 
located in front of the gene to be expressed, along with any 
necessary ribosome binding sites* RNA splice sites, polyadenylation 
site, and transcriptional terminator sequences. It will be 
understood that this invention, although described herein in terms 
of a preferred embodiment, should not be construed as limited to 
these sequences. For example, the origin of replication of other 
viral (e.g., Polyoma, Adeno, VSV, BPV, and so forth) vectors could 
be used, as well as cellular origins of DNA replication which could 
function in a nonintegrated state. 

In' such vertebrate cell hosts, the genetic expression 
vector for a urokinase product polypeptide hereof may also contain a 
secondary genetic coding sequence under the control of the same 
promoter. The secondary sequence provides for a convenient 
screening marker, both for transformants in general, and for 
transformants showing high expression levels for the primary 
sequence, as well as serving as a control device whereby the 
expression of the desired urokinase polypeptide can be regulated, 
most frequently enhanced. 
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This is particularly significant as the two proteins are 
produced separately in mature form. While both DNA coding sequences 
are controlled by the same transcriptional promoter, so that a fused 
message (mRNA) is formed, they are separated by a translational stop 
5 signal for the first and start, signal for the second, so that two 
independent proteins result. 

It has been recognized that environmental conditions are 
often effective in controlling the quantity of particular enzymes 
that are produced by cells under certain growth conditions. In a 

10 preferred embodiment, advantage is taken of the sensitivity of 
certain cells to methotrexate (MIX) which is an inhibitor of 
dihydrofolate reductase (DHFR) • DHFR is an enzyme which is 
required, indirectly, in synthesis reactions involving the transfer 
of one carbon units. Lack of DHFR activity results in inability of 

15 cells to grow except in the presence of those compounds which 

otherwise require transfer of one carbon units for their synthesis* 
Cells lacking DHFR, however, will grow in the presence of a 
combination of glycine, thymidine and hypoxantbine. 

Cells which normally produce DHFR- are known to be inhibited 

20 by methotrexate. Most of the time, addition of appropriate amounts 
of methotrexate to normal cells will result in the death of the 
cells. However, certain cells appear to survive the methotrexate 
treatment by making increased amounts of DHFR, thus exceeding the 
capacity of the methotrexate to inhibit this enzyme* It has been 

25 shown previously that in such cells, there is an increased amount of 
messenger RNA coding for the DHFR sequence. This is explained by 
assuming an increase in the amount of DNA in the genetic material 
coding for this messenger RNA* In effect, apparently the addition, 
of methotrexate causes gene amplification of the DHFR gene* Genetic 

30 sequences which are physically connected with the DHFR sequence 
although not regulated by the same promoter are also amplified. 
Consequently, it is possible to use the amplification of the DHFR 
gene resulting from methotrexate treatment to amplify concomitantly 
the gene for another protein, in this case, the desired urokinase 

35 polypeptide. 
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Moreover, if the host cells into which the secondary 
sequence for DHFR is introduced are themselves DHFR deficient* DHFR 
also serves as a convenient marker for selection of cells 
successfully transfected. If the DHFR sequence is effectively 
5 connected to the sequence for the desired peptide, this ability 
serves as a marker for successful transfection with the desired 
sequence as well* 

B. Vector Systems 
10 1.- Expression in Bacterial Systems 

Expression plasmids for bacterial use* e.g., E» coli 
are commonly derived using BR322 (37) as vector and appropriately 
inserting the heterologous gene sequence together with trans! ational 
start and stop signals in operable reading phase with a functional 
15 promoter, taking advantage of common or synthetically created 

restriction sites* The vector will carry one or more phenotypic 
selection characteristic genes and an origin of replication to 
insure amplification within the host. Again, the heterologous 
insert can be aligned so as to be expressed together with a fused 
20 presequence, derivable for example from the trp system genes. 
2. Expression in Yeiast 

To express a heterologous gene such as the cDNA for human 
urokinase in yeast, it is necessary to construct a plasmid vector 
containing four components. One component is the part which allows 

25 for transformation of both _E. coli and yeast and thus must contain a 
selectable gene from each organism. Thi.s can be the gene for 
ampicillin resistance f rom E. col i and the gene TRP1 from yeast,) 
This component also requires an origin of replication from both 
organisms to be maintained as a plasmid DNA in both organisms. This 

30 can be the E. coli origin from pBR322 and the arsl origin from 
chromosome III of yeast.) 

A second component of the plasmid is a S'-flanking sequence 
from a yeast gene to promote transcription of a downstream-placed 
structural gene. The 5'-f Tanking sequence can be that from the 

35 yeast 3~phospho- glycerate kinase (PGK) gene. The fragment is 

0320L 



0092182 

constructed in such a way so as to remove the AT 6 of the PGK 
structural sequence, replaced with a sequence containing alternative 
restriction sites* such as Xbal and EcoRI restriction sites, for 
convenient attachment of this 5' -flanking sequence to the structural 
5 gene. 

A third component of the system is a structural gene 
constructed in such a manner that it contains both an ATG 
translational start and translational stop signals- 

A fourth component is a yeast DNA sequence containing the 

10 3' -flanking sequence of a yeast gene, which contains the proper 
signals for transcription termination and polyadenylation. 
3. Expression in Mammalian Cell Culture 
The strategy for the synthesis of heterologous peptide in 
mammalian cell culture relies on the development of a vector capable 

15 of both autonomous replication and expression of a foreign gene 
under the control of a heterologous transcriptional unit. The 
replication of this vector in tissue culture is accomplished by 
providing a DNA replication origin (such as from SV40 virus) > and 
providing helper function (T antigen) by the introduction of the 

20 vector into a cell line endogenously expressing this antigen (33, 
34) . The late promoter of SV40 virus precedes the structural gene 
and ensures the transcription of the gene, 

A useful vector to obtain expression consists of pBR322 
sequences which provides a selectable marker for selection in 

25 JL» col i (ampicillin resistance) as well as an j:. coli origin of DNA 
replication* These sequences are derivable from the plasmid pML-1. 
The SV40 origin is derivable from a 342 base pair PvuII-Hindlll 
fragment encompassing this region (35* 36) (both ends being 
convertable to EcoRI ends). These sequences, in addition to 

30 comprising the viral origin of DNA replication, encode the promoter 
for both the early and late transcriptional unit. The orientation 
of the SV40 origin region is such that the promoter for the late 
transcriptional unit is positioned proximal to the gene encoding 
interferon. 
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Brief Description of the Drawings 

Figure A is a schematic diagram of urokinase polypeptides. Low 
molecular weight urokinase begins at amino acid 136 and ends at 
amino acid 412. High molecular weight urokinase starts at amino 
acid 1 and terminates at amino acid 412. Conversion of the one 
chain form of both high arid low molecular weight urokinase to the 
bioactive two-chain form occurs by proteolytic cleavage between 
amino acid 158 and 159. Pre-urokinase begins at amino acid-20. The 
depicted conformational positioning of the disulfide bonds is based 
on analogy with other serine proteases. 

Figures 1A to 1C depict the nucleotide sequence and restriction 
endemic! ease map of the plasmid pD2 cDNA insert for low molecular 
weight 33000 dalton urokinase bioactive protein. The nucleotide 
portion of the synthetic deoxyoligonucleotide CB6B probe is 
underlined. 

Figures 2A,B depict the deduced amino acid sequence of the cDNA 
sequence of Figure 1, the amino acids of the cDNA insert portion 
being numbered 1 to 279. 

Figures 3A to 3C depict the nucleotide sequence and restriction 
endonuclease map of the cDNA for full length human urokinase 
protein. The CB6B probe is likewise underlined. 

Figures 4A,B depict the deduced amino acid sequence for full 
length urokinase from the cDNA sequence of Figure 3. 

Figure 5 illustrates the construction of plasmid pUK33trpLE^ 
for expression of long fusion -33000 dalton protein. 

Figure 6 illustrates the construction of plasmid pUK33trpLEg 
for expression of short fusion -33000 dalton protein. 
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Figure 7 shows the construction of the plasmid for direct 
expression of the 33000 dalton protein. 

Figure 8 illustrates another construction of a plasrnid for 
5 direct expression of the 33K dalton protein. 

Figures 9 and 10 illustrate the construction of plasmids for 
the direct expression of 54K urokinse and a precursor form of 54K 
urokinase. 

10 i 

Figure 11 shows the construction of a plasmid (p-pEH3-Ba!14 
prellK54) for the expression of 54K urokinase in eukaryotic cells. 

Figure 12 illustrates the time dependent activation of, and the 
15 requirement for, plasminogen in a plasmin assay by urokinase 
produced as described herein. 

Figure 13 illustrates the plasminogen activating activity of the 
urokinase extracts hereof and the inhibition of that activity by 
20 antibodies raised against natural urokinase. 

Detailed Description 

A. Source of Urokinase mRNA 

Detroit 562 (human pharangeal carcinoma) cells (38}(ATCC No, CCL 
25 138) were cultured to confluency in Eagle's minimal essential medium 
(39) supplemented to contain 3 percent sodium bicarbonate (pH 7.5), 
1 percent L-'glut amine ( Irvine) , 10 percent fetal bovine serum, 1 
percent sodium pyruvate (Irvine), 1 percent non-essential amino 
acids (Irvine), 2.4 percent HEPES (pH 7.5), 50 vg/ml Garamycin, and 
30 incubated at 37*C in a 5 percent C0 2 atmosphere. Confluent cells 
were harvested by centrifugation after treatment with 0^25 percent 
trypsin for 15 minutes at 37°C. 

35 
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B. Messenger RNA Isolation 

Total RNA from Detroit 562 cells was extracted essentially as 
reported by Lynch et efL (40)* Cells were pelleted by 
centrifugation and approximately one gram of cells was then 
5 resuspended in 10 ml of 10 mM NaCl, 10 mM Tris'HCl (pH 7,4), 1.5 

wM MgClg. Cells were lysed by the addition of non-ionic detergent 
NP-4Q to a final concentration of 1 percent* Nuclei were removed by 
centrifugation and the RNA was further purified by two successive 
phenol (redistil led) /chloroform: isoamyl alcohol extractions at 

10 4*C. The aqueous phase was made 0.2 H NaCl and total RNA was 

precipitated by addition of two volumes of 100 percent ethanol and 
overnight storage at -20*C. Following centrifugation polyA mRNA. was 
purified from total RNA by oligo-dT cellulose chromatography (41)* 
Yields from 1 gram of cells were typically 10-15 mg of total RNA and 

15 ~2 percent of that was polyA mRNA. 

C Size Fractionation of polyA mRNA 
Size fractionation of 200 ug polyA mRNA was achieved by 
electrophoresis through an acid-urea gel composed of 1.75 percent 

20 agarose, 25 mM sodium citrate (pH 3.8) and 6 M urea (40,42). 

Electrophoresis was performed for 7 hours at 25 mA and 4°C* The gel 
was then fractionated manually with a razor blade. Individual 
slices of the gel were melted at 70*C, diluted into 12 mis 10 mM 
NaCl, 10 mM Tris'HCl (pH 7.4), 1.5 mM MgC1 2 , 0.1 percent SDS and 

25 extracted twice with water saturated, redistilled phenol and once 
with chloroform. Fractions were then ethanol precipitated and 
subsequently translated in vitro (43) in order to determine the 
affected size fractionation and integrity of the polyA mRNA. 

30 D* Preparation of 01igo~dT Primed Colony Library Containing 

Urokinase DNA Sequences 
Poly A mRNA was si2e-fractionated on acid urea gels. mRNA 
fractions greater than 12S were pooled and used as template for 
oligo-dT primed preparation of double stranded cDNA by standard 
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procedures (44,45). The cDNA was size fractionated on 6 percent 
polyacryl amide gel electrophoresis and 132 ng of ds cDHA greater 
than 350 basepairs in length was obtained by electroelution, A 30 
ng portion of ds cDNA was extended with deoxy C residues using 
5 terminal deoxynucleotidyl transferase (46) and annealed with 200 ng 
of the plasmid pBR322 (37) which had been similarly tailed with 
deoxy G residues at Pst I site (46), Each annealed mixture was then 
transformed into coli K12 strain 294 (ATCC No. 31446). 
Approximately 10000 ampicillin sensitive, tetracycline resistant 
10 transfdrmants were obtained. 

E. Preparation of Synthetic 0NA Oligomers for Use as Urokinase 
Screening Probes 

Eight synthetic DMA oligomers 14 bases long were designed 
15 complementary to mRNA based on the Met~Tyr-Asn-Asp-Pro amino acid 
sequence of a urokinase cyanogen bromide polypeptide fragment 
designated C86* These eight deoxyoligonucleotides were synthesized 
by the phosphotri ester method (17) in the following pools of two: 
(CB6A) 5' GGGTCGTTA/ GTACAT 3', (CB6B) 5' GGATCGTTA/ GTACAT 3'» (C86C) 
20 5 1 GGGTCATTA/ GTACAT 3*, (CB6D) 5' GGATCATTA/ GTACAT 3'- Each pool of 
two oligomers v/as then radioactively phosphorylated as follows: 250 
ng of deoxyoligonucleotide were combined in 25 \l1 of 60 mM 
Tris'HCl (pH 8), 10 mM MgClg, 15 mM beta-mercaptoethanol, and 
100 uCi (y- 32 P) ATP (Amersham, 5000 Ci/mMole). 5 units of T4 
25 polynucleotide kinase were added and the reaction was allowed to 

proceed at 37*C for 30 minutes and terminated by addition of EDTA to 
20 nM. 

F. Screening of Oligo dT Primed Colony Library for Urokinase 
30 Sequences 

10000 colonies were individually inoculated into wells of 
microtitre dishes containing LB (48) + 5 yg/ml tetracycline and 
stored at ~20°C after addition of DMSO to 7 percent. Individual 
colonies from this library were transferred to Schleicher and 
35 Schuell BA85/20 nitrocellulose filters and grown on agar plates 
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containing LB * 5 ug/ml tetracycline. After *~10 hours growth at 
37*C the colony filters were transferred to agar plates containing 
LB + 5 pg/ml tetracycline and 12,5 pg/ml chloramphenicol and 
re incubated overnight at 37*C. The DNA from each colony was then 
5 denatured and fixed to the filter by a modification of the 

Grunstein-Hogness procedure (49). Each filter was floated for 3 
minutes on 0*5 N NaOH, 1.5 M NaCl to lyse the colonies and denature 
the DNA then neutralized by floating for 15 1 on 3 H NaCl, 0,5 M 
Tris'HCl (pH 7.5). The filters were then floated for an 

10 additional 15 minutes on 2XSSC* and subsequently baked for 2 hours 
in an 80 *C vacuum oven. The filters were prehybridized for ~2 hours 
at room temperature in 0.9 M NaCl, IX Denhardts, lOOjrM Tris'HCl 
(pH 7.5} , 5 nM Na-EDTA, 1 mM ATP, 1 M sodium phosphate (dibasic), 
1 mM sodium pyrophosphate, 0.5 percent NP-40, and 200 pg/ml E. coli 

15 t-RNA, and hybridized in the same solution overnight essentially as 
described by Wallace et art. (50) using -40x10 cpm of each kinased 
CB6 deoxyoligonucleotide pool of 2. After extensive washing at 37*C 
in 6X SSC, 0.1 percent SDS, the filters were exposed to Kodak XR-5 
X-ray film with DuPont Lightning-Plus intensifying screens for 16-24 

20 hours at -80*C. Two colonies indicated hybridization with the 

mixture of eight probes: UK513dT69D2 (pD2) and UK523dT73Dl2 (pD12). 

G. Characterization of pD2 and pD12 Plasmid DNA 
Plasmid DNA isolated from E. coli colony UlC513dT69D2 and 

25 UK513dT73D12 by a rapid miniscreen method (51) was subjected to PstI 
restriction endonuclease analysis. This analysis strongly suggested 
that pD2 and pD12 are identical. Each plasmid DNA has 3 Pst I 
restriction fragments that comigrate when electrophoresed through a 
6 percent po ly aery 1 amide gel. The complete nucleotide sequence of 

30 the plasmid pD2 cDNA insert was determined by the dideoxynucleotide 
chain termination method (52) after subcloning the PstI restriction 
fragments into the M13 vector mp7 (53). Figure 1 presents the 
nucleotide sequence and Figure 2 presents the translated nucleotide 
sequence of the cDNA insert fragment of pD2. The entire coding 

35 region of low molecular weight (33K) urokinase was isolated on this 
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one large fragment of p02. The nucleotide sequence of the CB6B (5 1 
GGATCGTTA/ GTACAT) deoxyol igonucleotide includes nucleotides 466 
through 479 according to this map. A typical serine protease active 
site (gly asp ser gly gly pro) is present between amino acids 222 
5 and 227. The coding region consists of 838 basepairs or 279 amino 
acids of the carboxy terminal portion of high molecular weight (54K) 
urokinase* The stop codon UGA at amino acid position 280 begins ca. 
935 nucleotides of 3 1 untranslated sequence up to the poly A 
sequence. Because only 31413 daltons of full length urokinase v/ere 
10 encoded by the cDNA insert of pD2 it was necessary to construct 

additional colony banks containing urokinase sequences in order to 
identify high molecular weight urokinase. 

H. Construction of Two Different Specifically Primed Colony 
15 Banks for Amino Terminal Extension of the Existing 

Urokinase Clone 

The first specifically primed cONA bank utilized a 45 basepair 
urokinase DNA restriction endonuclease fragment beginning with Hael l 
in position 225 and ending with AccI in position 270 (Figure 1) as a 

20 primer rather than oligo ^2-18* This fragment was heat 

denatured in the presence of 20 pg unfractionated Detroit 562 polyA 
ttfiNA and cDNA was prepared according to procedures referenced in 
Section D„ 11.5 ng of double-stranded cDNA greater than 200 bp were 
electroeluted from a 6 percent polyacryl amide gel, and used to 

25 generate approximately 6000 clones in JE* coli 294, 

A second specifically primed cDNA bank called UK89CB6 of about 
4000 colonies was constructed using a pool of 4 ug polyA mRNA 
acid-urea agarose gel fraction 8 and 4 pg polyA mRNA from fraction 9 
(Section C)* 250 ng of each CB6 deoxyoligo- nucleotide pool 

30 (Section E) were used as primer rather than oligo d1*i2-18* 

I. Screening of Full Length Colony Bank 

Full length cDNA containing colonies were transferred directly 
to nitrocellulose filters then grown at 37 *C. The colonies were 
35 lysed and the DNA was denatured and fixed to the filters as 
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described in Section F (49). A ^P-labelled DNA probe was 
prepared (54) from a 143 basepair Hinfl to Haell restriction 
endonuclease fragment from the cDNA insert of pD2 and hybridized 
(55) with the filter fixed full length cDNA. 8xl0 5 CPM of the 
5 probe was hybridized for 16 hours then washed as described (55) and 
exposed to X-ray film. Two colonies demonstrated strong 
hybridization: A3 and E9. 

J* Characterization of Full Length Urokinase cDNA's pA3 and pE9 
10 PstI restriction analysis of A3 plasmid DNA showed cDNA insert 

fragments of -360 bp and ~50 bp, and of E9 plasmid DNA, one fragment 
at ~340 bp, PstI EcoR I double restriction of each plasmid DNA 
revealed a common cDNA insert fragment of ~190 bp as predicted where 
each plasmid DNA encoded urokinase sequence information 5* to the 
15 Hae ll AccI primer fragments Plasmid pA3 has an additional Pst I 
EcoR I cDNA insert fragment of 185 bp and E9 a 160 bp additional 
fragment. The larger ~360 bp Pst I cDNA insert fragment of pA3 was 
subcloned into the M13 vector mp7 (53) and sequenced by the 
dldeoxynucleotide chain termination method (52). The urokinase 
20 coding sequence of pA3 is located from approximately position 405 to 
position 785 in the cDNA sequence for full length urokinase protine 
in Figure 3, 

K. Screening of Urokinase Colony Bank UK89CB6 
25 DNA from ~1900 UK89CB6 cDNA insert containing colonies was 

denatured and fixed to nitrocellulose filters as previously 

32 

described in Section F. A P-labelled DNA probe was prepared 
(54) from the 146 bp PstI Hinfl fragment of the cDNA insert fragment 
of pA3. 40xl0 6 CPm of this probe was then hybridized to the 
30 filter bound DNA of UK89CB6 colonies for 16 hours then washed as 
described (55) and exposed to X-ray film. Two colonies 
demonstrating positive hybridization were UK89CB6F1 (pFl) and 
UK89CB6H10 (pHIO). 
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L. Characterization of pFI and pHIO Urokinase cDN As 
PstI restriction of pFI demonstrates cDNA insert fragments of 
~450 bp and -125 bp, and pHIO shows one Pst I cDHA insert fragment of 
~500 bp. Pst I EcoR I double restriction of pFI indicates no EcoRI 

5 restriction site and has cDMA insert fragments identical to those of 
the PstI restriction alone- pHIO does demonstrate an EcoR I 
restriction site, producing cDMA insert fragments of ~375 bp and 
-110 bp. Thi s EcoR I site of pHIO is probably the same EcoR I site as 
noted at position 627 (Figure 3). The pFI cDNA insert does not 

10 contain this EcoRI restriction site. 

pFI, having a cDNA insert fragment longer than that of pHIO, was 
selected for sequencing. Both Pst I restriction fragments of the pFI 
cDNA insert were sequenced by M13 subcloning and dideoxy 
sequencing. The composite nucleotide sequence of UK cDNA inserts 

15 from pFI, pA3 and pD2 encoding the entire amino acid sequence of 
high molecular weight full length urokinase is shown in Figure 3„ 
The urokinase coding sequence of pFI is depicted from position 1 
approximately to position 570. The urokinase coding sequence of pD2 
is located from position 532 to position 2304 in Figure 3. 

20 The amino terminal serine at amino acid position one as 

determined by amino acid sequence analysis is shown in Figures A and 
4, The preceding 20 amino acids at the amino terminus beginning 
with met and ending with gly probably serves as a signal sequence 
for the secretion of the remaining 411 amino acids of high molecular 

25 weight urokinase. This putative signal sequence has features, such 
as size and hydrophobi city (56,57), in common with other 
characterized signal sequences. 

Trypsin cleavage sites rendering 33K two-chain low molecular 
weight urokinase from high molecular weight urokinase are as 

30 follows: lys at position 136 is the amino terminal amino acid of 

the short chain and ile at position 159 is the amino terminus of the 
long chain (Figures A, 4) . 

35 - 
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Expression of Low Molecular Weight Derivatives of Urokinase 
in E. coli 

. 1. Long trp LE fusion (Figure 5) 

A plasmid (pNCV, 58) was constructed which has the 

5 following properties: 1) the plasmid is a derivative of pBR322 and 

is present in about 20 copies per cell. 2) the plasmid makes its 

iL* coli host resistant to tetracycline. 3) the plasmid contains an 

inducible tryptophan promoter* which directs the synthesis of a 

protein consisting of a fusion between the trp leader peptide and 

10 the trp E structural gene (LE fusion gene)* 4) A unique PstI 

restriction site v/as constructed in the trp E gene, which can be 

used to clone PstI DMA fragments, fay converting the EcoRI site at 

the distal end of the LE gene in plasmid pS0M7&lA4 to a PstI site 

flanked by two EcoRI sites using a synthetic sequence: 

15 AATTCT6CAG 

GACGTCTTAA. 

The DNA fragment containing the trp promoter and LE gene was then 
introduced into plasmid pBR322 to give plasmid pNCV (47A) . 

The urokinase PstI fragment from nucleotide position 5* (the 
2Q Pst I 5*- cleavage site) to nucleotide position 1130 (Figure 1) was 
cloned into the Pst I site of pNCV in such a v/ay that a fusion 
protein is made upon induction of the trp promoter* The N-terminal 
part is trp LE and the C-terminal part is low molecular weight 
urokinase* 

25 With reference to Figure 5, 5 pg of plasmid pUK513dT69D2 (pD2) 

was digested with 20 units PstI and the 1125 bp cDNA insert fragment 
encoding low molecular weight urokinase was isolated by 6 percent 
po ly aery 1 amide gel electrophoresis. ~1 pg of this insert was 
electroeluted from the gel* phenol /chloroform eKtracted and ethanol 

3 q precipitated, 1 pg of the vector plasmid pNCV (58) was digested 

with 10 units Pst I and ethanol precipitated after phenol /chloroform 
extraction* ~100 ng of this 1125 bp fragment was combined with -100 
ng PstI digested pNCV in 20 pi of 20 nM Tris'HCl (pH 7.5), 10 nM 
MgCl 2 , 10 mM DTT, 2,5 mM ATP and 30 units of T4 DNA ligase* After 

35 overnight ligation at 14*C, one half of the mixture was transformed 
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in E m coli K12 strain 294. BamHI digestion of the DNA from twelve 
transformants showed 3 with the proper orientation. Expression of 
this plasrnid (pUK33trpLE L ) (Figure 5) in E. col 1 yielded a long 
trp LE fusion protein including 33000 urokinase. The 33000 
5 urokinase is activated by cleavage with a trypsin-like active enzyme 
between positions 3 and 4 and positions 26 and 27 (see Figure 2). 

2. Short trp LE fusion (Figure 6) 

Plasrnid pINCV was constructed* It is similar to pNCV (see 

10 supra, ) in every respect except that the majority of the trp E gene 
is deleted. In this plasrnid the Bglli site was converted to a P stl 
site and the region between this new PstI site and the original PstI 
site was deleted. -100 ng of pINCV was digested with Pst I and 
Hind i I I then phenol/CHCl 3 extracted and ethanol precipitated. ~3 

15 ug of pUK33trpLE L (Figure 6) was digested to completion with 

Hindi II and partially digested with PstI to yield a Pst I Hindlll DNA 
fragment of 1150 bp which was purified by electroelution after 
electrophoresis on a 6 percent polyacrylamide geK The Pst I site 
within the structural UK gene was spared from digestion. All of the 

20 P$t l Hindlll digested pINCV was mixed with -50 ng of the 1150 bp 
Hindll I, partial PstI fragment of pUK33trpLE L and ligated 
overnight at 14 a C. This mixture was then transformed into E. coli 
K12 strain 294* BamH I digestion confirmed the proper construction 
of this plasrnid (pUK33trpLE s ) (Figure 6). Expression of this 

25 plasrnid in coli yielded a fusion protein from which 33,000 
urokinase is activated* as described supra . 

3. Direct Expression of 33K Urokinase (Figures 7 and 9) 
A urokinase DNA fragment beginning with nucleotide 16 

30 (Figure 1), was cloned into a pBR322 derivative resulting in a 
construction in which the trp promoter is positioned directly in 
front of this urokinase fragment encoding low molecular weight 
urokinase, j The plasrnid pLeIFAtrpl03 (Figure 7) is a derivative of 
the plasrnid pLeIFA25 (58} in which the Eco RI site distal to the 

35 LeIFA gene has been removed (59). 10 ng of pLeIFAtrp!03 (Figure 7) 
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was digested with 20 units EcoR I phenol/CHCl 3 extracted and 
ethanol precipitated. The EcoRI cohesive ends of the plasmid DNA 
molecules were extended to flush ends using 12 units of DNA 
Polymerase I in a 50 pi reaction containing 60 mM NaCl, 7 mM 
5 MgCLg, 7 mM Tris'HCl {pH 7.4) and 1 mM in each ribonucleotide 
triphosphate* The reaction v/as incubated at 37°C for 1 hour, 
extracted with phenol/CHClg and precipitated with ethanol. The 
DNA was then resuspended in 50 pi of 10 mM Tris'HCl (pH 8), 1 mM 
EOT A and treated with 500 units Bacterial Alkaline Phosphatase for 
10 30 minutes at 65 *C, twice extracted and ethanol precipitated. After 
digestion with PstI the mixture was electrophoresed on a 6 percent 
poly aery 1 amide gel and the -3900 bp vector fragment was 
electroeluted. 

The plasmid pUK33trpLE^ was transformed into JE. coli K12 
15 strain GM48 (deoxyadenosine methyl ase") in order that DNA purified 
from this£. col i strain could be digested with restriction 
endonticlease Bel t (60)* 4 pg of this DNA were treated for 1 hour at 
50*C with 6 units of Bell (in 75 mM KC1, 6 mM Tris'HCl (pH 7.4), 
10 mM MgCl 2> 1 m DTT), then made 50 mM NaCl and digested with 10 
20 units PstI. 6 percent gel electrophoresis was performed and the 914 
bp fragment was electroeluted. 

A 14 nucleotide DNA primer encoding the amino acid sequence met 
lys lys pro was synthesized by the phosphotri ester method (47) and 
has the following sequence: 

25 MetLysLysPro 

5 1 CTATGAAAA AGCC G 3' , 

500 ng of this primer were phosphorylated at the 5 1 end with 10 
units T4 DNA Kinase in a 20 pi reaction containing 0.5 mM ATP. The 

30 264 bp Pst I AccI cDNA insert fragment of pUK33trpLE L (grown in 

coli GM48) was isolated. ~500 ng of this fragment resuspended in 
10 pi of deionized water were mixed with the 20 pi of the 
phosphorylated primer, heated to 95*C for 3 minutes and quick frozen 
in a dry-ice ethanol bath. The denatured DNA solution was made 60 

35 m NaCl, 7 mM MgClg, 7 mM Tris'HCl (pH 7.4), 1 mM in each deoxy 
ribonucleotide triphosphate and 12 units DNA polymerase I large 
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fragment was added* After 2 hours incubation at 37°C this primer 
repair reaction was phenol/CHCl 3 extracted and ethanol 
precipitated and digested to completion with Bell at 50°C. The 
reaction mixture was then run on a 6 percent poly acryl amide gel and 
-50 ng of the 200 bp ami no-terminal blunt-end to BcU fragment was 
electroeluted. Subsequently* ~5Q ng of the blunt-Bel I primer-repair 
fragment, ~1QQ ng of the Bel l Pst I carboxy-terminal fragment and 
-100 ng of the ~390Q bp vector fragment were ligated overnight at 
14*C and transformed into E. coli 294, EcoR I digestion of a number 
of transformants indicated the proper construction and DNA sequence 
analysis proved the desired sequence through the initiation codon of 
this new plasmid, pUK33trpl03 (Figure 7) • In this construction, the 
N~terminal methionine is followed by two lysines which in turn are 
followed by the amino acid sequence 5 through 279 as depicted in 
Figure 2A, Expression of this plasmid in E. coli resulted in the 
synthesis of low molecular weight urokinase* This protein was 
activated with a trypsin-like active enzyme as described supra which 
serves to cleave the N-terminal lysine pair and cleaves between 
lysine in position 26 and isoleucine in position 27 (Figure 2A) . 

We found it desirable to construct a derivative plasmid of 
pUK33trpl03 that would confer tetracycline resistance to its host 
cell* Figure 8 depicts the following construction of 
pUK33trpl03Ap R ~Tc R * 5^g of pHGH207-l (See infra) was digested 
with Hpal and PvuII. The vector fragment was isolated and 
purified* 5pg of pUK33trpl03 was digested with Hpal and BamHl, 
electrophoresed on 6 percent poly acryl amide and the 836 bp DMA 
fragment was purified. A second 5ug aliquot of pUK33trpl03 was 
digested with BamHl and PvuII for isolation and purification of the 
119 bp DNA fragment. Equal molar amounts of each of these three DNA 
fragments were ligated overnight at 14 °C and used to transform 
JL* co1 * 294 - Restriction endonuclease analysis of plasmid DNA from 
several ampicillin resistant transformants verified the proper 
construction of pl)K33trpl03Ap and the reversal in orientation of 
the trp promoter/UK33 encoding DNA. 
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~5yg pBR322 DNA was digested with EcoRl and the cohesive ends 
were filled in with Klenow Poll. After Pstl digestion, the large 
vector fragment containing the DNA that encodes tetracycline 
resistance, the origin of replication, and a portion of the 
5 ampicillin resistance gene was isolated and purified. ~5ng of 
pUK33trpl03 Ap R was digested with BamHl and Pstl. The DNA 
fragment encoding the remaining portion of the ampicillin resistance 
gene, the trp promoter and most of low molecular weight urokinase 
was purified. Approximately equal molar amounts of these two DNA 
10 fragments and the 119 bp BamHl-PvuII DNA fragment from 

pUK33trpl03Ap R was ligated overnight at 14 °C to complete the 
construction of pUK33trpl03Ap R Tc R * This plasmid was employed in 
the construction of a plasmid designed to express high molecular 
weight full length urokinase (see Section N and Figure 9). 

15 

N» Expression of High Molecular Weight Derivatives of Urokinase 
1. Direct Expression of 54K Urokinase 
Figure 10 illustrates construct for full length urokinase. 
Plasmid pHGH207-l, having a single trp~promoter» was obtained by 

20 removal of the double lac-promoter from pHGH 207 that has a double 
lac-promoter followed by a single trp-promoter. This was done as 
follows: The trp-promoter 310bp DNA fragment was obtained from pFIF 
trp 69 (20) by digestion with EcoRI* This fragment was inserted 
into pHGH 107 (44) that had been opened with EcoRI. Thus, a plasmid 

25 was obtained (pHGH 207) that has a double lac promoter followed by 
the trp-promoter, flanked by EcoRI sites. The thus obtained pHGH 
207 was digested with BamHI; this was partially digested with EcoRI 
and the largest fragment was isolated. This fragment therefore has 
the entire trp-promoter. From pBR322 the largest EcoRI-BamHI 

30 fragment was isolated. Both fragments were ligated and the mixture 
was used to transform E. coli 294. Tet r > Amp r colonies were 
isolated and most of them had the plasmid with the structure as 
shown for pHGH207-l. Plasmid pHGH207-l is thus a derivative of the 
plasmid pHGH107 (44) and has the following properties: 1) the human 

35 growth hormone gene is flanked by the tryptophan promoter rather 
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than the lac promoter as with pHGHl07> 2) the plasmid confers 
ampicillin and tetracycline resistance when expressed in _E. coli . 
(See also 47A). 

20 pg of pHGH207-l was partially digested with EcoRI and totally 

digested with Bg II I. Purification of the large vector fragment was 

achieved by 5 percent polyacryl amide gel electrophoresis, 

electroelution, phenol/CHCl^ extraction and ethanol 

precipitation, 14 pg of pFl was digested with Bglll and Tacjl and 

the 236 bp DMA fragment was isolated and purified from a 6 percent 

polyacryl amide gel. The following complementary DNA fragments were 

synthesized by the phosphotri ester method (47): 

MetSerAs nG 1 uLeuHi sG 1 nV alPro 
5* AATTATGAGCAATGAATTACATCAA6TTCCAT 

T ACTC GTTACTT AATGTAGTTC A AGGTAGC 5* 

As indicated, the amino acid sequence Met Ser Asn Glu Leu His Gin 
Val Pro encodes the initiation codon, ATG, and the eight 
ami no-terminal amino acids of high molecular weight urokinase. 50 
ng of each synthetic DNA fragment were phosphorylated and the 
fragments were mixed, heated to 65*C for 1 minute and allowed to 
cool at room temperature for 5 minutes* 10 ng of the phosphorylated 
and mixed synthetic DNA fragments were combined with ~2Q0 ng of the 
partial EcoRI, Bglll pHGH207-l vector fragment and -50 ng of the 236 
bp Bgl ll TagI ONA fragment, ligated overnight at 14 *C and 
transformed into JE. coli 294- Individual plasmid DMAs from 24 
ampicillin resistant colonies were digested with EcoR I and Bgll l and 
one plasmid (plntl) demonstrating the proper construction was 
selected for DNA sequence analysis. This analysis verified the 
correct DNA sequence through the ATG initiation codon and the 
ami no-terminal portion of high molecular weight urokinase, 

4 pg of pNCV (Section M) was digested with Bglll and Clal and 
the large vector fragment was isolated and purified from a 5 percent 
polyacryl amide gel. 30 vg of pFl was digested with Pst I and Bglll 
and electrophoresed through a 6 percent polyacryl amide gel. The 44 
bp DNA fragment was electroeluted, phenol /CHCU extracted and 
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ethanol precipitated. 4 u g of pA3 DNA were digested with Pstl and 
TaqI and the 192 bp DNA fragment was purified from a 6 percent 
polyacryl amide gel. ~100 ng of the B_g1.II Clal vector DNA fragment, 
-50 ng of the 192 bp fragment and ~50 ng of the 44 bp fragment were 
5 combined and ligated overnight at 14*C then transformed into _E„ coli 
294 * Bgl ll Cla l double digestion of plasmid DNA from several 
tetracycline resistant transf ormants demonstrated the correct 
construction of this new plasmid named plnt2. 

5yg of pUK33trpl03Ap R Tc R grown in E. coli GM48 (ATCC No. 

10 39099, April 9, 1982) was digested with Bell and Sail for isolation 
and purification of the 1366 bp DNA fragment. The 108 bp EcoRl-Bcll' 
DNA fragment was isolated from the same plasmid. Approximately 
equal molar amounts of the EcoRl-Sall DNA fragment from 
pUK33trpl03Ap R Tc R and the EcoRl-Bcll DNA fragment also from 

15 pUK33trpl03Ap R Tc R were ligated overnight at 14*C to yield plnt3. 
5 ng of plnt3 DMA was digested with Bglll and Sail, 
electrophoresed through 6 percent polyacryl amide and the 1701 bp 
fragment containing the carboxy terminal portion of full length 
urokinase and the amino terminal portion of the tetracycline 

20 resistance gene was purified. ~5 jig of p intermediate 1 DNA was 
digested with Bgll l and Sail, electrophoresed through 6 percent 
polyacryl amide and the large vector fragment containing the 
carboxy-terminal portion of the tetracycline resistance gene* the 
origin of replication, the ampicillin resistance gene, the trp 

25 promoter, the initiation codon ATG, and the ami no-terminal portion 
of full length urokinase was purified. -100 ng of the vector 
fragment and ~100 ng of the 1701 bp fragment were combined, ligated 
overnight at 14*C and transformed into JE. coli 294. A plasmid from 
a tetracycline resistant and ampicillin resistant colony 

30 demonstrating the correct PvuI I restriction endonuclease pattern was 
identified and confirmed the construction of full length urokinase 
downstream from the trp promoter. This is plasmid pUK54trp207-4, 
Full length urokinase is produced by expression of this plasmid in 
1- co1i 294. 

35 
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2. Direct Expression of Pre-UK 54K in E. coli (Figure 10) 
The following scheme was used to construct a plasmid for 
direct expression of preUK54. Two complementary DNA fragments were 
synthesized by the phosphotri ester method (47) encoding the amino 
acid sequence initiation codon ATG followed by the first three 
N-terminal amino acids of the urokinase presequence Arg Ala Leu as 
shown below 

EcoRl MetArgAlaLeu Bgll 
5 1 AATTATGCGTGCCCTGC 
TACGCACGGG5* 



EcoRl and Bgll restriction endonuclease cleavage sites flank this 
portion of the presequence. 50ng of each synthetic DNA fragment was 
phosphorylated. The phosphorylated fragments were mixed s heated to 
65*C for 1 minute and allowed to cool to room temperature. 5ug of 
pFl DNA was digested with Bgll and Bgl2 and the 310bp UK DNA 
fragment was isolated and purified. -50ng of the 310bp UK DBA 
fragment, ~15Gng of the partial EcoRl, Bglll vector DNA fragment 
from pHGH207-l (see Section N) and lOng of the phosphorylated and 
2Q mixed synthetic DNA fragments was ligated overnight at 14 *C and 
transformed into E, coli 294» Individual plasmid DNA's isolated 
from several ampicillin resistant transf ormants were analyzed to 
confirm the proper construction and nucleotide sequence of the 
presequence of high molecular weight urokinase* 
25 One correct plasmid was named plnt4. 5ug of plnt4 DNA was 

digested with Bglll and EcoRv for isolation and purificaton of the 
vector DNA fragment containing the N-terminal nucleotides of 
pre-UK54» the trp promoter, ampicillin resistance genes, origin of 
replication and a portion of the tetracycline resistance encoding 
30 DNA. 5pg of pUK54trp207-l DNA was digested with Bglll and EcoRv. 

The BglllEcoRv DNA fragment encoding the remainder of UK54 and the 
tetracycline resistance encoding DNA was isolated, purified and 
ligated with the Bglll EcoRv vector DNA fragment to complete the 
construction of the plasmid p-preUK54trp207~U 

35 
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0. Direct Expression (Pre-) Urokinase in Tissue Culture 

Figure 11 depicts the introductin of the gene encoding 
preurokinase into the eukaryotic expression vector p342E (62) 
capable of replication and expression of preurokinase in permissive 

5 monkey cells* IQug of p342E DNA was digested with XBA1 and ^100 
base pairs were removed in each direction using Bal31 nuclease 
(fragment i). lOOng of Hindlll linker 5 1 CTCAAGCTTGAG synthesized 
by the phosphotri ester method (47) was phosphorylated, heated to 
65°C for 1 minute and allowed to cool to room temperature. The 

10 phosphorylated linker and fragment 1 were ligated overnight at 14*C 
and transformed into coli 294, Restriction endonuclease analysis 
of one transformant named pEH3-Ba!14 proved the introduction of a 
Hindlll restriction endonuclease site and the loss of the XBA1 
site. 5ug of pEH3-BAL14 DNA was digested with Hindlll and Hpal. 

15 The cohesive ends were extended to blunt ends using Klenow Poll. 

The DNA was treated with BAP and the vector fragment containing the 
SV-40 early promoter, ampicillin resistance genes and origin of 
replication was isolated and purified (fragment 2). 5pg of 
ppreUK54trp207-l was digested with Clal and XBal. The cohesive ends 

20 were extended with Klenow Poll and the DNA fragment encoding preUK54 
was isolated and purified (fragment 3). ~100ng of fragment 2 and 
-lOOng of fragment 3 were ligated overnight at 14°C and transformed 
* n JL» eft 11 . Z94 * Restriction endonuclease analysis of plasmid DNA 
named pEH3-BAL14 preliK54 from one transformant verified the correct 

25 construction. pEH3~BAL14 pre UK54 DNA was then used to transfect 
permissive monkey cells (62) for the expression preUK54 and the 
secretion of full length high molecular weight urokinase. 

P. Isolation and Characterization 

30 Urokinase containing residue was isolated from JE. coli . The 

residue was dissolved in 5M guanidine HC1, containing 50 mM Tris, pH 
8.0. The solution was diluted to 1M guanidine HC1, 5QnM Tris HC1, 
pH 9, at a protein concentration of 1 ng/ml. The solution was then 
brought to 2 mM reduced glutathione (GSH)» 0.2 mM oxidized 

35 glutathione (GSSG) and incubated overnight at room temperature. The 
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resulting solution containing refolded protein was then dialyzed 
into aqueous medium. The resulting solution contained urokinase 
which showed lOOPU/mg activity. 

Upon purification following conventional techniques* the protein 
is characterized showing the expected N-terminal sequence for both 
chains of the low molecular weight, bioactive material. C-terminal. 
analysis also shows the proper sequence for both chains* The 
protein migrates at a molecular weight of -30,000 daltons. It has a 
specific activity of -170,000 PU/mg 9225,000 IU/mg), assuming 1 
mg/ml has an 0D 280 of 1.3. 

Q. Assays for Detection of Expression of Urokinase 
1. Chromogenic Substrate 
a* Theory 

The assay is based on the proteolytic cleavage of a 
tripeptide from a chromophoric group. The rate of cleavage can be 
directly related to the specificity and to the concentration of the 
protease being tested. Urokinase cleaves the chromogenic substrate 
S2444 (purchased from Kabi Group Inc., Greenwich, CT) . By 
monitoring the generation of the chromophore, one can determine the 
amount of functional urokinase present in a sample* Urokinase is 
synthesized as an inactive precurser form, with activation occurring 
via the cleavage between residue 26 (lysine) and 27 (isoleucine) 
(numbering based on the protease clone, Figure 2A}» Some 
preparations of urokinase were found to have been autoactivated 
and/or were activated by E. coli proteases. To insure activation of 
urokinase, allowing detection by this chromogenic technique, 
treatment of the sample with low amounts of trypsin is required. 
Trypsin is a protease which can cleave the lysine-isoleucine bond 
required for urokinase activation. However, trypsin can also cleave 
the chromogenic substrate, and therefore must be eliminated from the 
assay. Soybean trypsin inhibitor (STI) is a protein which will 
inactivate trypsin while having no effect on urokinase. Therefore, 
the assay consists of trypsin activation of urokinase, STI 
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inhibition of the trypsin* and, finally, addition of the chromogenic 
substrate to measure the functional urokinase present. 

b. Procedure 

5 The assay is performed as follows: 0.2mL of 0.1M 

Tris, pH 8*0* 50pL of the sample to be assayed, and 5uL of trypsin 
(O.lmg/mL in 0.1M Tris, pH 8.0 plus Q.25M CaClg) were added to a 
test tube and the sample incubated for 10 minutes at 37*C. The 
trypsin was inactivated by the addition of 2jiL of lOmg/mL STI (in 

10 0.1M Tris, pH 8.0). Urokinase activity was determined by adding 
50uL of a ImM solution of S2444 (in water) and incubating the 
reaction for 10 minutes at 37°C. Acetic acid (50j*L) was added to 
stop the reaction, the solution centr if uged to remove a precipitate* 
and the absorbance at 405nm was determined. The actual amount of 

15 urokinase can be calculated by comparison of a sample with the 
reading obtained by performing the assay with dilutions of a 
standard solution of a known amount of urokinase (obtained from 
Calbiochem* San Diego, CA) * 

20 2* Direct Ass^y of Plasmin Formation 

a* Theory 

A much more sensitive assay for urokinase can be 
obtained by monitoring the urokinase catalyzed conversion of 
plasminogen to plasmin. Plasmin is an enzyme for which there are 

25 chromogenic substrate assays based on the same principles as 

described in 1 above. The basis of the assay is the determination 
of the amount of plasmin formed following incubation of the 
urokinase containing solution with a solution of plasminogen. The 
greater the amount of urokinase, the greater the amount of plasmin 

30 formed, 

b. Procedure 

An aliquot of the sample is mixed with 0.10 ml of 0.7 
mgs/ml plasminogen (in 0.5M Tris'HCl, pH 7.4, containing .012 M 
35 NaCl) and the volume adjusted to 0.15 ml. The mixture is incubated 
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at 37°C for various times (as indicated), 0.35 ml of S2251 (1.0 mM 
solution in above buffer) is added and the reaction continued for 5 
minutes at 37°C. Acetic acid (25 uL) is added to terminate the 
reaction and absorbance at 405 nm is measured. Quantitation of the 
5 amount of activity is obtained by comparison with dilutions of a 
standard urokinase solution. 

3. Indirect Assay of Plasmin Formation 

a. Theory 

A sensitive assay for urokinase activity has been 
developed (61). The assay is based on determination of plasmin 
formation by measuring the extent of plasmin digestion of fibrin in 
an agar plate containing fibrin and plasminogen. Plasmin produces a 
clear lysis zone in the fibrin plate. The area of this lysis zone 
can be correlated to the amount of urokinase in the sample* 

b, Procedure 
Following the procedure of Granelli-Piperno and Reich 

(61), the plates were incubated one to three hours at 37*C and lysis 
20 zones measured* Quantitation was obtained by performing the assay 
with dilutions of a standard urokinase solution. 

R. Detection of Urokinase Activity 

1* Bacterial Growth and Urokinase Sample Preparaton 

25 A strain of _E. coli (W3110) was transformed using a plasmid 

(pUK33trpLEs) containing a urokinase fusion protein. This 
expression vehicle (short trpLE fusion) was described above. The 
cells were grown on minimal media overnight, to an 0**0. at 550nm of 
1*2* An additional 200mL of media was added* Indole acrylic acid, 

3° a compound which appears to enhance expression of the tryptophan 
operon controlled genes, was added to a concentration of 10 ug/mL. 
The cells were incubated 2 hours and harvested. The cells obtained 
from 400mL of media were suspended in water and guanidine was added 
to a concentration of 7M (final volume of 40 mL). The solution was 

35 incubated for 90 minutes at room temperature. Insoluble material 
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was removed by centrif ligation* The supernatant was dialyzed against 
0,01 M Tris'HCl, pH 7.5, containing 0.1 M NaCl for 4 hours* 
Insoluble material was removed by centrif ligation and the sample was 
dialyzed for 2.5 hours against 0.01 M Tris'HCl, pH 7.5. 
5 The sample was applied to a 3*9 x 9 cm DE-52 column, which 

had been equilibrated with 0-01 M Tris'HCl, pH 7.5* and the column 
washed with the same buffer and eluted with 0.01 M Tris*HCl, pH 
7-5, containing 0,15 M NaCl. The peak of activity was pooled and 
used for all further studies* 

10 

2. Activity Detection 

Figure 12 shows the results of the direct activation of 
plasminogen by fractionated JE. Coli extracts when assayed under 
conditions similar to that described in Section Q.2. supra . Ah 

-J5 activity is generated which is dependent on the presence of 
plasminogen. Therefore, the activity being monitored is a 
plasminogen dependent activity. The activity being measured also 
increases with time, indicating a time dependent, catalytic 
generation of plasmin. These properties are consistent with those 

20 of urokinase, i.e., a catalytic activation of plasminogen. Similar 
extraction conditions performed on E. coli which do not contain the 
urokinase plasmid do not activate plasminogen under these conditions. 

The extracts also were tested in the assays as described 
above in order to detect and quantitate urokinase activity. Figure 

25 13 shows the effect of various amounts of the bacterially derived 
fractions in the assay described in Section Q.2 with a 10 minute 
activation- k The values obtained are compared to those obtained 
using a standard urokinase solution (Plough Unitage determination). 
The extent of plasminogen activation is directly proportional to the 

30 amount of added cloned urokinase. Antibodies raised against 

purified natural urokinase are known to decrease the activity of 
natural urokinase. The effect of these antibodies when added to 
this assay at time 0 are also shown in Figure 13- A marked 
inhibition of the ji. coli derived material is observed. This proves 

35 that the activity observed in the E. coli derived material has the' 
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same antigenic sites as natural urokinase and therefore that it is 
indeed urokinase being microbial ly synthesized. 

Similar results for activity detection and antibody 
inhibition are observed using the fibrin plate assay (described in 
Section Q.3). These results are summarized in Table I. 



10 



15 



SAMPLE 

Urokinase 
standard 



Urokinase 
produced 
herei n 



TABLE I 

FIBRIN PLATE ASSAY OF UROKINASE PROTEIN 

Activity in presence 
Activity of urokinase antibody Percent 

( Plough Units/mL ) 1 (Plough Units/ml) 1 Inhibition 



112 
11 
1.1 



480 
224 



2.5 

0.45 

0 



1.12 
0 



98 
96 
100 



99 
100 



1 Standard curve obtained by addition of a known amount of 
20 urokinase standard to wells. Values obtained f or j:. coli 

fractions and antibody inhibition obtained by extrapolation from 
this standard curve. 

The standard urokinase activity was inhibited 96 percent or 
greater by the addition of urokinase antibodies raised against this 

25 

protein. Assay of the jE. coli derived extracts had significant 
urokinase activity. This activity was almost completely inhibited 
when antibodies against natural urokinase were added to the assay. 

The third assay (the chrornogenic substrate assay) also 
detected a urokinase-like activity in the £. coli extracts. Since 
it is the least sensitive of the assays, antibody inhibition studies 
could not be performed due to the large quantities of antibody 
required to see an inhibition. 

The above three assays were quantitated using the standard 
35 urokinase purchased from Calbiochem. The values obtained (Plough 
units per mL) were all of the same order of magnitude: 500 for 
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fibrin plate; 100 for plasminogen activation; and 350 for the 
chromogenic substrate (S2444). Variations occuring are doubtless 
due to the relatively impure nature of the material at the time of 
testing, 

5 

Pharmaceutical Compositions 

The compounds of the present invention can be formulated 
according to known methods to prepare pharmaceutical ly useful 
compositions, whereby the human urokinase product hereof is combined 

10 in admixture with a pharmaceutical^ acceptable carrier vehicle. 

Suitable vehicles and their formulation are described for example in 
Remington's Pharmaceutical Sciences by E.W. Martin, which is hereby 
incorporated by reference. Such compositions will contain an 
effective amount of the protein hereof together with a suitable 

15 amount of vehicle in order to prepare phannaceutically acceptable 
compositions suitable for effective administration to the host. 
A. Parenteral Administration 

The human urokinase hereof may be parenterally administered to 
subjects suffering from thromboembolic diseases or conditions, 

20 Dosage and dose rate may parallel those currently in use in clinical 
applications of other cardiovascular, thrombolytic agents, e.g., 
about 4400 IU/kg body weight as an intravenous priming dose followed 
by a continuous intravenous infusion at about 4400 IU/kg/hr. for 12 
hours, in patients suffering from pulmonary embolism. 

25 As one example of an appropriate dosage form for essentially 

homogeneous human urokinase in parenteral form applicable herein, a 
vial containing 250000 IU urokinase activity, 25 mg. mannitol and 
45 mg* sodium chloride, may be reconstituted with 5 ml sterile water 
for injection and admixed with a suitable volume of 0.9 percent 

3° Sodium Chloride Injection or 5 percent Dextrose Injection for 
intravenous administration. 

The human urokinase protein hereof has been defined by means of 
determined DNA gene and deductive amino acid sequencing* It will be 
understood that natural allelic variations exist and occur from 

35 individual to individual. These variations may be demonstrated by 
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(an) amino acid diff erence(s) in the overall sequence or by 
deletions, substitutions, insertions, inversions or additions of 
(an) amino acid(s) in said sequence. In addition, the potential 
exists in the use of recombinant DNA technology for the preparation 
5 of various human urokinase derivatives, variously modified by 

resultant single or multiple amino acid substitutions, deletions, 
additions or replacements, for example, by means of site directed 
mutagenesis of the underlying DNA* ATI such modifications and 
allelic variations resulting in derivatives of human urokinase are 

10 included within the ambit of this invention so long as the 
essential, characteristic human urokinase activity remains 
unaffected in kind. 

Notwithstanding that reference has been made to particular 
preferred embodiments, it will be further understood that the 

15 present invention is not to be construed as limited to such, rather 
to the lawful scope of the appended claims* 
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Claims: 



y l* A protein comprising the enzymatic portion of human urokinase, 
essentially free of other protein of human origin. 

5 

y 2. A protein comprising the enzymatic portion of human urokinase, 
produced by a recombinant host cell* 

3. A protein comprising the enzymatic portion of human urokinase 
10 unaccompanied by associated native glycosylate on. 

✓ 4. A protein according to Claims 1 or 3 containing a sequence of a . 
polypeptide extending from the N-terminus of the ord inar ily 
first amino acid of said protein, 

15 

^ 5. As a protein of Claim 1, bioactive full length human urokinase 
as a homogeneous polypeptide* 

j 6. As a protein of Claim 1, a bioactive human urokinase polypeptide 
20 in homogeneous f orm. 

SI* A DNA sequence comprising a sequence encoding a protein 
comprising the enzymatic portion of human urokinase. 

25 s 8. The DNA sequence according to Claim 7 operably linked with a DNA 
sequence capable of effecting expression thereof. 

s 9* A replicable expression vehicle capable, in a transformant 
microorganism or cell culture, of expressing a DNA sequence 
30 according to Claim 7* 

^ 10. A microorganism or cell culture transformed with the vehicle 
according to Claim 9. 

35 
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11. A microorganism according to Claim 10, obtained by transforming 
an J;, coli strain. 

12. A plasmid selected from the group consisting of pUK33trpLEL, 
pUK33trpLEs, pUL33trpl03, pUK54trp2Q7-l and pprdJK54trp2Q7-l. 

13. A microorganism or cell culture transformed with each of the 
plasmids according to Claim 12. 

14. A composition comprising a therapeutically effective amount of a 
protein comprising the enzymatic portion of human urokinase 
according to Claim 1 in admixture with a pharmaceutic ally 
acceptable carrier. 

15. The composition according to Claim 14 suitable for parenteral 
administration. 

16. A culture of cells genetically altered so as to direct the 
production of a protein comprising the enzymatic portion of 
human urokinase. 

17* The use of a protein according to Claim 1 for treatment of 

vascular diseases or coj^ijtions or for preparing pharmaceutical 
compositions useful for such treatment. 

18. A process which comprises expressing, a gene encoding a protein 
comprising the enzymatic portion of human urokinase in a 
microorganism or cell culture. 

19. A process for producing an expression vehicle according to Claim 
9 comprising constructing a first DNA sequence encoding a 
protein conprising the enzymatic portion of human urokinase and 
operably linking said first DNA sequence with a second DNA 
sequence capable of effecting expression of said first OKA 
sequence. 
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20. A process for preparing bioactive human urokinase protein from 
essentially bioinactive protein which comprises proteolytically 
cleaving the latter in vi tro . 

21. The process according to Claim 20 wherein the protein is cleaved 
at the N™terminus of low molecular weight protein. 

22. The process according to Claim 20 wherein the protein is cleaved 
at the Lys-Ile junction of the sequence Lys-Ile-Ile-Gly-Gly. 

23. The process according to Claim 20 wherein the product is 2-chain 
human urokinase otherwise corresponding to native product. 

24. The process according to Claim 20 wherein the cleaving is 
conducted with trypsin • 
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Pstl Mnll Mnll MboII 

1 CTGCAGG6SGS6GGGAGATGGAAAAAA6CCCTCCTCTCCTCCAGAAGAATTAAAATTTCA 
GACGTCCCCCCCCCCTCTACCTTTTTTCGGGAGGAGAGGAGGTCTTCTTAATTTTAAAGT 

Mnll 

BalIHaeIIIHinfIMnlISau96 EcoRI T rip 1A 

61 GTGTGGCCAAAAGACTCTGAGGCCCCGCTTTAAGATTATTGGGGGAGAATTCACCACCAT f 1 0 . 1 A 
CACACCGGTTTTCTGAGACTCCGGGGCGAAATTCTAATAACCCCCTCTTAAGTGGTGGTA 
Hael DdelHaelll HphI 

aql BstNI Fnu4HI Mhll Hpall HphI 

1 21 CGAGAACCAGCCCTGGTTTGCGGCCATCTACAGGAGGCACCGGGGGGGCTCTGTCACCTA 
GCTCTTGGTCGGGACCAAAC6CCGGTAGATGTCCTCCGTGGCCCCCCCGAGACAGTGGAT 
EcoRI I Hael I I Neil 

MnllBbv HphlDpnIHaell 
181 CGTGTGTGGAGGCAGCCTCATCAGCCCTTGCTGGGTGATCAGCGCCACACACTGCTTCAT 
GCACACACCTCCGTCGGAGTAGTCGGGAACGACCCACTAGTCGCGGTGTGTGACGAAGTA 
Fnu4HIMnlI BclISau3AHhaI 

Mnll AccI EcoRI I 

241 TGATTACCCAAAGAAGGAGGACTACATCGTCTACCTGGGTCGCTCAAGGCTTAACTCCAA 
ACTAATGGGTTTCTTCCTCCTGATGTAGCAGATGGACCCAGCGAGTTCCGAATTGAGGTT 

BstNI 

Mnll Mnll Haell 

301 CACGCAAGGGGAGATGAAGTTTGAGGTGGAAAACCTCATCCTACACAAGGACTACAGCGC 
STGCGTTCCCCTCTACTTCAAACTCCACCTTTTGGAGTAGGATGTGTTCCTGATGTCGCG 

Hhal 

HphI MboIIDpnl Mnll 

361 TGACACGCTTGCTCACCACAACGACATTGCCTTGCTGAAGATCCGTTCCAAGGAGGGCAG 
ACTGTGCGAACGAGTGGTGTTGCTGTAACGGAACGACTTCTAGGCAAGGTTCCTCCCGTC 

XhoIISau3A 

MstlBbv -Nc1I Mnll Dpnl 

421 GTGTGCGCAGCCATCCCGGACTATACAGACCATCTGCCTGCCCTCGATGTATAACGATCC 
CACACGCGTCGGTAGGGCCTGATATGTCTGGTAGACGGACGGGAG CTACATATTGCTAGG 
HhaIFnu4HI Hpall TaqI Sau3A 

Alul Dpnl EcoRI 
481 CCAGTTTGGCACAAGCTGTGAGATCACTGGCTTTGGAAAAGAGAATTCTACCGACTATCT 
GGTCAAACCGTGTTCGACACTCTAGTGACCGAAACCTTTTCTCTTAAGATGGCTGATAGA 
Sau3A 

HpaIIFnu4HIPvuII Alul Hpall Fnu 

541 CTATCCGGAGCAGCTGAAAATGACTGTTGTGAAGCTGATTTCCCACCGGGAGTGTCAGCA 
GATAGGCCTCGTCGACTTTTACTGACAACACTTCGACTAAAGGGTGGCCCTCACAGTCGT 
BbvAluI Nc1I Bbv 

4HI HphI Fnu4HI 

601 GCCCCACTACTACGGCTCTGAAGTCACCACCAAAATGCTATGTGCTGCTGACCCCCAATG 
CGGGGTGATGATGCCGAGACTTCAGTGGTGGTTTTACGATACACGACGACTGGGGGTTAC 

Bbv 

Hinfl BstNI Hinfl AvallMnll Mnll Hael I 

661 GAAAACAGATTCCTGCCAGGGAGACTCAGGGGGACCCCTCGTCTGTTCCCTCCAAGGCCG . 
CTTTTGTCTAAGGACGGTCCCTCTGAGTCCCCCTGGGGAGCAGACAAGGGAGGTTCCGGC 
EcoRI I Ddel Sau96 Fnu4 

I Alul Sau96 BstNI 

721 CATGACTTTGACTlaGAATTGTGAGCTCfi'iGCCuTGGATGrGCCCTGAAGGACAAGCCAGG 
GTAf TQAAACTGACCTTAAC AGTCGA( C CGGCACCT ACACGGGACTTCCTGTTCGGTCC 
Hi .<m EcoRI 
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AcylAccI Hinfl EcoRIIBamHISau3A MboII 

781 CGTCTACACGAGAGTCTCACACTTCTTACCCTGGATCC6CAGTCACACCAAGGAAGAGAA 
GCAGATGTGCTCTCAGAGTGTGAAGAATGGSACCTAGGCGTCAGTGTGGTTCCTTCTCTT 
Hgal BstNIXhoUDpnl 

HaelBstNIHaeinMnllMnllAvallEcoRIIMnU FIG 1R 

841 TGGCCTGGCCCTCTGAGGGTCCCCAGGGAGGAAACGGGCACCACCCGCTTTCTTGCTGGT ,u,,u 
ACCGGACCGGGAGACTCCCAGG6GTCCCTCCTTTGCCCGTGGTGGGCGAAAGAACGACCA 
HaeIIIEcoRIISau96DdeISau96BstNI 

Hinfl PvuII MboII MboH 

901 TGTCATTTTTGCAGTAGAGTCATCTCCATCAGCTGTAAGAAGAGACTGGGAAGATAGGCT 
ACAGTAAAAACGTCATCTCAGTAGAGGTAGTCGACATTCTTCTCTGACCCTTCTATCCGA 

Alul 

EcoRIIHphI Alul mi 

961 CTGCACAGATGGATTTGCCTGTGGCACCACCAGGGTGAACGACAATAGCTTTACCCTCAC 
GACGTGTCTACCTAAACGGACACCGTGGTGGTCCCACTTGCTGTTATCGAAATGGGAGTG 

BstNI 

HaeIHaeIIIEcoRIIFnu4HI MhllHaelHaelUBstNIMnHAvallHiiri 
1021 GGATAGGCCTGGGTGCTGGCTGCCCAGACCCTCTGGCCAGGATGGAGGGGTGGTCCTGAC 
CCTATCCGGACCCAC6ACCGACGGGTCTGGGAGACCGGTCCTACCTCCCCACCAGGACTG 
StuIBstNI Bbv BatlEcoRII Sau96 

fl PstI 
1081 TCAACATGTTACT6ACCAGCAACTTGTCTTTTTCTGGACTGAAGCCTGCAGGAGTTAAAA 
AGTTGTACAATGACTGGTCGTTGAACAGAAAAAGACCTGACTTCGGACGTCCTCAATTTT 

Taql Alul Hpall 

1141 AGGGCAGGGCATCTCCTGTGCATGGGCTCGAAGGGAGAGCCAGCTCCCCCGACCGGTGGG 
TCCCGTCCCGTAGAGGACACGTACCCGAGCTTCCCTCTCGGTCGA6GGGGCTGGCCACCC 

MnllHaelll XmnI Fnu4HI 

1201 CATTTGTGAGGCCCATGGTTGAGAAATGAATAATTTCCCAATTAGGAAGTGTAAGCAGCT 
GTAAACACTCCGGGTACCAACTCTTTACTTATTAAAGGGTTAATCCTTCACATTCGTCGA 
Sau96NcoI BbvAIu 

DdelMnll Mnll Alul Bbv 
1261 GA6GTCTCTTGAGGGAGCTTAGCCAATGTGGGAGCAGCGGTTTGGGGAGCAGAGACACTA 
CTCCAGAGAACTCCCTCGAATCGGTTACACCCTCGTCGCCAAACCCCTCGTCTCTGTGAT 
I Ddel Fnu4HI 

1321 ACGACTTCAGGGCAGGGCTCTGATATTCCATGAATGTATCAGGAAATATATATGTGTGTG 
TGCTGAAGTCCCGTCCCGAGACTATAAGGTACTTACATAGTCCTTTATATATACACACAC 

Alul 

1381 TATGTTTGCACACTTGTTGTGTGGGCTGTGAGTGTAAGT6TGAGTAAGAGCTGGTGTCTG 
ATACAAACGTGTGAACAACACACCCGACACTCACATTCACACTCATTCTCGACCACAGAC 

SfaNI 

1441 ATTGTTAAGTCTAAATATTTCCTTAAACTGTGTGGACTGTGATGCCACACAGAGTGGTCT 
TAAGAATTCAGATTTATAAAGGAATTTGACACACCTGACACTACGGTGTGTCTCACCAGA 

Mhll EcoRIISau96MnlIAvaII EcoR 

1501 TTCTGGAGA 3 f; TTATAGGTCACTCCTGGGGCCTCTTGGGTCCCCCACGTGACA6TGCCT6 
A/V \CCTCT': VrATCCAGTGAGGACCCCGGAGAACCCAGGGGGTGCACTGTCACGGAC 

BfitNIHaelll Sau96 BstN 
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II Rsal PstIFnu4HI Ddel 
15&1 GfiAATGTACTTATTCTGCAGCATGACCTGTGACCAGCACTGTCTCAGtTTCACTTTCACA Clfl IP 
CCTTACATGAATAAGACGTCGTACTGGACACTGGTCGTGACAGAGTCAAAGTGAAAGTGT • IU. 1 1# 
I Bbv 

BallHaelll Mill 
1621 TA6ATGTCCCTTTCTTGGCCAGTTATCCCTTCCTTTTAGCCTAGTTCATCCAATCCTCAC 
ATCTACAGGGAAAGAACCGGTCAATAGGGAAGGAAAATCGGATCAAGTAGGTTAGGAGTG 
Hael 

HphI Sau96 

1681 TGGGTGGGGT6AGGACCACTCCTTACACTGAATATTTATATTTCACTATTTTTATTTATA 
ACCCACCCCACTCCTGGTGAGGAATGTGACTTATAAATATAAAGTGATAAAAATAAATAT 
Mnl IAvall 

Bc1ISau3A 

1741 TrTTTGTAATTTTAAATAAAAGTGATCAATAAAATGTGATTTTTCTGA 
AAAAACATTAAAATTTATTTTCACTAGTTATTTTACACTAAAAAGACT 

Dpnl 
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FIG.2A 

1 10 
asp gly lys lys pro ser ser pro pro glu glu leu lys phe gin 
CTGCAGGGGGGGGGGA GAT GGA AAA AAG CCC TCC TCT CCT CCA GAA GAA TTA AAA TTT CAG 

20 30 
cys gly gin lys thr leu arg pro arg phe lys ile ile gly gly glu phe thr thr lie 
TGT GGC CAA AAG ACT CTG AGG CCC CGC TTT AAG ATT ATT GGG GGA GAA TTC ACC ACC ATC 

40 50 
glu asn gin pro trp phe ala ala ile tyr arg arg his arg gly gly ser val thr tyr 
GAG AAC CAG CCC TG6 TTT GCG GCC ATC TAC AGG AGG CAC CGG GGG GGC TCT GTC ACC TAC 

60 70 
val cys gly gly ser leu ile ser pro cys trp val ile ser ala thr his cys phe ile 
GTG TGT GGA GGC AGC CTC ATC AGC CCT TGC TGG GTG ATC AGC GCC ACA CAC TGC TTC ATT 

80 90 
asp tyr pro lys lys glu asp tyr ile val tyr leu gly arg ser arg leu asn ser asn 
GAT TAC CCA AAG AAG GAG GAC TAC ATC GTC TAC CTG GGT CGC TCA AGG CTT AAC TCC AAC 

100 110 
thr gin gly glu met lys phe glu val glu asn leu ile leu his lys asp tyr ser ala 
ACG CAA GGG GAG ATG AAG TTT GAG GTG GAA AAC CTC ATC CTA CAC AAG GAC TAC AGC GCT 

120 130 
asp thr leu ala his his asn asp ile ala leu leu lys ile arg ser lys glu gly arg 
GAC ACG CTT GCT CAC CAC AAC GAC ATT GCC TTG CTG AAG ATC CGT TCC AAG GAG GGC AGG 

140 150 
cys ala gin pro ser arg thr ile gin thr ile cys leu pro ser met tyr asn asp pro 
TGT GCG CAG CCA TCC CGG ACT ATA CAG ACC ATC TGC CTG CCC TCG ATG TAT AAC GAT CCC 

160 170 
gin phe gly thr ser cys glu ile thr gly phe gly lys glu asn ser thr asp tyr leu 
CAG TTT GGC ACA AGC TGT GAG ATC ACT GGC TTT GGA AAA GAG AAT TCT ACC GAC TAT CTC 

180 190 
tyr pro glu gin leu lys met thr val val lys leu ile ser his arg glu cys gin gin 
TAT CCG GAG CAG CTG AAA ATG ACT GTT GTG AAG CTG ATT TCC CAC CGG GAG TGT CAG CAG 

200 210 
pro his tyr tyr gly ser glu val thr thr lys met leu cys ala ala asp pro gin trp 
CCC CAC TAC TAC GGC TCT GAA GTC ACC ACC AAA ATG CTA TGT GCT GCT GAC CCC CAA TGG 

220 230 
lys thr asp ser cys gin gly asp ser gly gly pro leu val cys ser leu gin gly arg 
AAA ACA GAT TCC TGC CAG GGA GAC TCA GGG GGA CCC CTC GTC TGT TCC CTC CAA GGC CGC 

240 250 
met thr leu thr gly ile val ser trp gly arg gly cys ala leu lys asp lys pro gly 
ATG ACT TTG ACT GGA ATT GTG AGC TGG GGC CGT GGA TGT GCC CTG AAG GAC AAG CCA GGC 

260 270 
val tyr thr arg val ser his phe leu pro trp ile arg ser his thr lys glu glu asn 
GTC TAC ACG AGA GTC TCA CAC TTC TTA CCC TGG ATC CGC AGT CAC ACC AAG GAA GAG AAT 

279 

gly leu ala leu OP 

GGC CTG GCC CTC TGA GGGTCCCCAGGGAGGAAACGGGCAC CAC CCGCTTTCTTGCTGGTTGTCATTTTTGCAGTA 
GAGTCATCTCCATCAGCTGTAAGAAGAGACTGGGAAGATAGGCTCTGCACAGATGGATTTGCCTGTG^CACCACCAGG 
GTGAACGACAATAGCTTTACCCTCACGGATAGGCCTGGGTGCTGGCTGCCCAGACCCTC'O; XAL \ 3GAGGCGTG 
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FIG.2B 

6TCCTGACTCAACATGTTACTGACCAGCAACTTGTCTTTTTCTG6ACTGAAGCCTSCAGGA6TTAAAAAGG6CAG6GC 
ATCTCCTGTGCATGGGCTCGAAGGGAGAGCCAGCTCCCCCGACCGGTGGGCATTTGTGAGGCCCATGGTTGAGAAATG 
AATAATTTCCCAATTAGGAAGTGTAAGCAGCTGAGGTCTCTTGAGGGAGCTTAGCCAATGTGGGAGCAGCGGTTTGGG 
GAGCAGAGACACTAACGACTTCAGGGCAGGGCTCTGATATTCCAT6AATGTATCAGGAAATATATATGTGTGTGTATG 
TTTGCACACTTGTTGTGTGGGCTGTGAGTGTAAGTGTGAGTAAGAGCTGGTGTCTGATTGTTAAGTCTAAATATTTCC 
TTAAACTGTGTGGACTGT6ATGCCACACAGAGTGGTCTTTCTGGAGAGGTTATAGGTCACTCCTGGGGCCTCTTGGGT 
CCCCCACGTGACAGTGCCT6GGAATGTACTTATTCTGCAGCATGACCTGTGACCAGCACTGTCTCAGTTTCACTTTCA 
CATAGATGTCCCTTTCTTG6CCAGTTATCCCTTCCTTTTAGCCTAGTTCATCCAATCCTCACTGGGTGGGGTBAGGAC 
CACTCCTTACACTGAATATTTATATTTCACTATTTTTATTTATATTTTTGTAATTTTAAATAAAAGTGATCAATAAAA 
TGTGATTTTTCTGA 
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BbvHaell Thai Mnll Fnu4HIHaeIII MnlIFnu4HI Hael 
1 GTCCCCGCAGCGCCGTCGCGCCCTCCTGCCGCAGGCCACCGAGGCCGCCGCCGTCTASCG 
CAGGGGCGTCGCGGCAGCGCGGGAGGACGGCGTCCGGTGGCTCCGGCGGCGGCAGATCGC 

Fnu4HIHhaI Hhal HaelBgll HaeIIIFnu4HI Kha r-i 

r G.3A 

I Mnll Bgll .Thai HgalEcoRII 

61 CCCCGACCTCGCCACCATGAGAGCCCTGCT66CGCGCCTGCTTCTCTGCGTCCTGGTCGT 
GGGGCTGGAGCGGTGGTACTCTCGGGACGACCGCGCGGACGAAGAGACGCAGGACCAGCA 

I Hhal Hhal BstNI 

Hinfl Bbv TaqI 
1 21 GAGCGACTCCAAAGGCAGCAATGAACTTCATCAAGTTCCATCGAACTGTGACTGTCTA AA 
CTCGCTGAGGTTTCCGTCGTTACTTGAAGTAGTTCAAGGTAGCTTGACACTGACAGATTT 
Fnu4HI 

Mnll Rsal 
181 TGGAGGAACAT8TGT6TCCAACAAGTACTTCTCCAACATTCACTGGT6CAACTGCCCAAA 
ACCTCCTTGTACACACAGGTTGTTCATGAAGAGGTTGTAAGTGACCACGTTGACGGGTTT 

MnllBbv Mnll 
Z41 GAAATTCGGAGGGCAGCACTGTGAAATAGATAAGTCAAAAACCTGCTATGAGGGGAATGG 
CTTTAAGCCTCCCGTCGTGACACTTTATCTATTCAGTTTTTGGACGATACTCCCCTTACC 
Fnu4HI 

Mnll Hael Ncol HaeIIIHpaIISau96BstNI 

301 TCACTTTTACCGAGGAAAGGCCAGCACTGACACCATGGGCCGGCCCTGCCTGCCCTGGAA 
AGTGAAAATGGCTCCTTTCCGGTCGTGACTGTGGTACCCGGCCGGGACGGACGGGACCTT 
Haelll Sau96NaeIHaeIII EcoRII 

Rsal SfaNIXhoIISau3AMboIIA1uISa 
361 CTCTGCCACTGTCCTTCA6CAAACGTACCATGCCCACAGATCTGATGCTCTTCAGCTGGG 
GAGACGGTGACA6GAAGTCGTTTGCATGGTACGGGTGTCTAGACTACGAGAAGTCGACCC 

BglllOpnl PvuII Ha 

u96EcoRII PstI Hpall EcoRII 

421 CCTGGGGAAACATAATTACTGCAGGAACCCAGACAACCGGAGGCGACCCTGGTGCTATGT 
GGACCCCTTTGTATTAATGACGTCCTTGGGTCTGTTGGCCTCCGCTGGGACCACGATACA 
elllBstNI Mnll BstNI 

Sau96 Fnu4HI MstI 
481 GCAGGTGGGCCTAAAGCCGCTTGTCCAAGAGTGCATGGTGCATGACTGCGCAGATGGAAA 
CGTCCACCCGGATTTCGGCGAACAGGTTCTCACGTACCACGTACTGACGCGTCTACCTTT 
Hael II Hhal 

Mnll Mnll MjoII HaelHaelllHinflMnllSa 
541 AAAGCCCTCCTCTCCTCCAGAAGAATTAAAATTTCAGTGTGGCCAAAAGACTCTGAGGCC 
TTTCGGGAGGAGAGGAGGTCTTCTTAATTTTAAAGTCACACCGGTTTTCTGAGACTCCGG 
Mnll Ball DdelHael 

U96 EcoRI TaqI BstNI Fnu4H 

601 CC GCTTTAAGATTATTGGGGGAGAATTCACCAC CATCGAGAACCAGCCCTGGTTTGCGGC 
GGC6AAATTCTAATAACCCCCTCTTAAGTGGTGGTAGCTCTTGGTCGGGACCAAACGCCG 

II HphI EcoRII Hae 

I Mnll Hpall HphI MnllBbv 

661 CATCTACAGGAGGCACCGGGGGGGCTCTGTCACCTAGGTGT6TGGAGGCAGCCTCATCAG 
GTAGATGTCCTCCGTGGCCCCCCCGAGACAGTGGATGCACACACCTCCGTCGGA6TAGTC 

III Neil Fnu4HIMn1I 

HphlDpnlHaell Mnll 
721 CCCTTGCTGGGTGATCAGCGCCACACACTGCTTCATTGAT !"." XCAAAGAAGGAGGACTA 
GGGAACGACCCACTAGTCGCGGTGTG T GACGAA6TAACTA U ^GTTTCTTCCTCCTGAT 
BclISa .3AHhaI 
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AccI BstNI Mn 
781 CATCGTCTACCTGGGTCGCTCM6GCTTAACTCCAACACGCAAG6GGAGATGAA6TTTGA 

GTAGCAGATGGACCCAGCGAGTTCCGAATTGAGGTTGTGCGTTCCCCTCTACTTCAAACT 

*«n FIG3B 

II Mnll Haell HphI 

841 GGTGGAAAACCTCATCCTACACAAGGACTACAGCGCTGACACGCTTGCTCACCACAACGA 
CCACCTTTTGGAGTAGGATGTGTTCCTGATGTCGCGACTGTGCGAACGAGTGGTGTTGCT 

Hhal 

MboIISau3A Mnll MstIFnu4HI Neil 

901 CATTGCCTTGCTGAAGATCCGTTCCAAGGAGGGCAGGTGTGCGCAGCCATCCCGGACTAT 
GTAACGGAACGACTTCTAGGCAAGGTTCCTCCCGTCCACACGCGTCGGTAGGGCCTGATA 
XhoIIDpnl HhalBbv Hpall 

Mnll Sau3A Alul Dpn 

961 ACA6ACCATCTGCCTGCCCTCGATGTATAACGATCCCCAGTTTGGCACAAGCTGTGAGAT 
TGTCTGGTAGACGGACGGGAG CTACATATTGCTAGGG GTCAAACCGTGTTCGACACTCTA 
Taql Dpn I Sau 

I EcoRI HpaIIFnu4HIPvuII 

102 1 CACTGGCTTTGGAAAAGAGAATTCTACCGACTATCTCTATCCGGAGCAGCTGAAAATGAC 
GTGACCGAAACCTTTTCTCTTAAGATGGCTGATAGAGATAGGCCTCGTCGACTTTTACTG 
3A BbvAluI 

Alul Hpall Bbv H 

1081 TGTTGT6AAGCTGATTTCCCACCGG6AGTGTCAGCAGCCCCACTACTACGGCTCTGAAGT 
ACAACACTTCGACTAAAGGGTGGCCCTCACAGTCGTCGGGGTGATGATGCCGAGACTTCA 
Neil Fnu4HI 

phi Bbv Hinfl EcoRII Hi 

1141 CACCACCAAAATGCTATGT6CTGCTGACCCCCAATGGAAAACAGATTCCTGCCAGGGAGA 
GTGGTGGTTTTACGATACACGACGACTGGGG6TTACCTTTTGTCTAAGGACG6TCCCTCT 
Fnu4HI BstNI 

nflDdelAvallMnll Mnll Haelll A1 

1201 CTCAGGGGGACCCCTCGTCTGTTCCCTCCAAGGCCGCAT6ACTTTGACTGGAATTGTGAG 
GAGTCCCCCTGGGGAGCAGACAAGGGAGGTTCCGGCGTACTGAAACTGACCTTAACACTC 
Sau96 Fnu4HI 

ul Sau96 EcoRII Acyl AccI Hinfl 

1261 CTGGGGCCGTGGATGTGCCCTGAAGGACAAGCCAGGCGTCTACACGAGAGTCTCACACTT 
GACCCCGGCACCTACACGGGACTTCCTGTTCGGTCCGCAGATGTGCTCTCAGAGTGTGAA 
Haelll BstNI Hgal 

BstNIBamHIDpnl MboII HaelBstNIHaelllMnl IMnl IS 

1321 CTTACCCTGGATCCGCAGTCACACCAAGGAAGAGAATGGCCTGGCCCTCTGAGGGTCCCC 
6AATGGGACCTAGGCGTCAGTGTGGTTCCTTCTCTTACCGGACCGGGAGACTCCCAGGGG 
EcoRI IXhoIISau3A HaeIIIEcoRIISau96DdeIAv 

BstNIMnll Hinfl 
1381 AGGGAGGAAACGGGCAC C AC CCGCTTTCTTGCTGGTTGTCATTTTTGCAGTAGAGTCATC 
TCCCTCCTTTGCCCGTGGTGGGCGAAAGAACGACCAACAGTAAAAACGTCATCTCAGTAG 
EcoRII 

PvuII MboII MboII 
1441 TCCATCAGCTGTAAGAAGAGACTGGGAAGATAGGCTCTGCACAGATGGATTTGCCTGTGG 
AGGTAGTCGACATTCTTCTCTGACCCTTCTATCCGAGACGTGTCTACCTAAACGGACACC 
Alul 

EcoRIIHphI Alul Mnll HaeIHaeIIIEcoRIIFnu4 

1501 CACCACCAGGGTGAACC \t .AATASCTTTACCCTCACGGATAGGCCTGGGTGCTGGCTGCC 
GTGGTGGTCCCACTPtf ^"TATCGAAATGGGAGTGCCTATCCGGACCCACGACCGACGG 
BstKI St ul BstNI Bbv 
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HI MnTIHaelHaelllBstNIMnllAvallHinfl 
1561 CAGACCCTCT6GCCAGGATGGAGGGSTGBTCCTGACTCAACATGTTACTGACCAGCAACT 
GTCTGGGAGACCGGTCCTACCTCCCCACCAGGACTGAGTTGTACAATGACTGGTCGTTGA 
BallEcoRII Sau96 

FIG.3C 

1 621 TGTCTTTTTCTGGACTGAAGCCTGCAGGAGTTAAAAAGGGCAGGGCATCTCCTGTGCAT6 
ACAGAAAAAGACCTGACTTCGGACGTCCTCAATTTTTCCCGTCCCGTAGAGGACACGTAC 

TaqI Alul Hpall MnllHaelll 

1681 GGCTCGAAGGGAGAGCCAGCTCCCCCGACCGGTGGGCATTTGTGAGGCCCATGGTTGAGA 
CCGAGCTTCCCTCTCGGTCGAGGGGGCTGGCCACCCGTAAACACTCCGG6TACCAACTCT 

Sau96NcoI 

Xmnl BbvPvuIIDdel Mnll Alul 

1741 AATGAATAATTTCCCAATTAGGAAGTGTAAGCAGCTGAGGTCTCTTGAGGGAGCTTAGCC 
TTACTTATTAAAGGGTTAATCCTTCACATTCGTCGACTCCAGAGAACTCCCTCGAATC6G 

Fnu4HIA1uIMn1I Ddel 

Fnu4HI 

1801 AATGTGGGAGCAGCGGTTTGGGGAGCAGAGACACTAACGACTTCAGGGCAGGGCTCTGAT 
TTACACCCTCGTCGCCAAACCCCTCGTCTCTGTGATTGCTGAAGTCCCGTCCCGAGACTA 
Bbv 

1861 ATTCCATGAATGTATCAGGAAATATATATGTGTGTGTATGTTTGCACACTTGTTGTGTGG 
TAAGGTACTTACATAGTCCTTTATATATACACACACATACAAACGTGTGAACAACACACC 

Alul 

1921 GCTGTGAGTGTAAGTGTGAGTAAGAGCT66TGTCTGATTGTTAAGTCTAAATATTTCCTT 
CGACACTCACATTCACACTCATTCTCGACCACAGACTAACAATTCAGATTTATAAAGGAA 

SfaNI Mnll E 

1981 AAACTGTGTGGACTGTGATGCCACACAGAGTGGTCTTTCTGGAGAGGTTATAG6TCACTC 
TTTGACACACCTGACACTACGGTGTGTCTCACCAGAAAGACCTCTCCAATATCCAGTGAG 

B 

coRIISau96MnlISau96 EcoRII Rsal PstIFnu4HI 

2041 CTGGGGCCTCTTGGGTCCCCCACGTGACAGTGCCTGG6AATGTACTTATTCTGCAGCATG 
GACCCCGGAGAACCCAGG6G6TGCACTGTCACGGACCCTTACATGAATAAGACGTCGTAC 
stNIHaelll Avail BstNI Bbv 

Ddel KaelHaell 
2101 ACCTGTGACCAGCACTGTCTCAGTTTCACTTTCACATAGAT6TCCCTTTCTTGGCCAGTT 
TGGACACTGGTCGTGACAGAGTCAAAGTGAAAGTGTATCTACAGGGAAAGAACCGGTCAA 

Ball 

I Mnll HphI Sau96 

2161 ATCCCTTCCTTTTAGCCTAGTTCATCCAATCCTCACTGGGTGGGGTGAGGACCACTCCTT 
TAGGGAAGGAAAATCGGATCAAGTAGGTTAGGAGTGACCCACCCCACTCCTGGTGAG6AA 

MnllAvall 

Be 

2221 ACACTGAATATTTATATTTCACTATTTTTATTTATATTTTTGTAATTTTAMTAAAAGTG 
TGTGACTTATAAATATAAAGTGATAAAAATAAA7ATAAAAACATTAAAATTTATTTTCAC 

S 

1 lOpnl 

• ATCAATAAAATGTGATTTTTCTGA 
™ V'-iTTATTTTACACTAAAAAGACT 
A 
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GTCCCCGCAGCGCCGTCGCGCCCTCCTGCCGCAGGCCACCGAGGCCGCCGCCGTCTAGC6CCCCGACCTCGCGACG 
-20 -10 

met arg ala leu leu ala arg leu leu leu cys val leu val val ser asp ser lys gly 
ATG AGA GCC CTG CTG GCG CGC CTG CTT CTC TGC GTC CTG GTC GTG AGC GAC TCC AAA GGC 

1 10 20 

ser asn glu leu his gin val pro ser asn cys asp cys leu asn gly gly thr cys val 
AGC AAT GAA CTT CAT CAA GTT CCA TCG AAC TGT GAC TGT CTA AAT GGA GGA ACA TGT GTG 

30 40 
ser asn lys tyr phe ser asn ile his trp cys asn cys pro lys lys phe gly gly gin 
TCC AAC AAG TAG TTC TCC AAC ATT CAC TGG TGC AAC TGC CCA AAG AAA TTC GGA GGG CAG 

50 60 
his cys glu He asp lys ser lys thr cys tyr glu gly asn gly his phe tyr arg gly 
CAC TGT GAA ATA GAT AAG TCA AAA ACC TGC TAT GAG GGG AAT GGT CAC TTT TAC CGA GGA 

70 80 
lys ala ser thr asp thr met gly arg pro cys leu pro trp asn ser ala thr val leu 
AAG GCC AGC ACT GAC ACC ATG GGC CGG CCC TGC CTG CCC TGG AAC TCT GCC ACT GTC CTT 

90 100 
gin gin thr tyr his ala his arg ser asp ala leu gin leu gly leu gly lys his asn 
CAG CAA ACG TAC CAT GCC CAC AGA TCT GAT 6CT CTT CAG CTG GGC CTG GGG AAA CAT AAT 

110 120 
tyr cys arg asn pro asp asn arg arg arg pro trp cys tyr val gin val gly leu lys 
TAC TGC AGG AAC CCA GAC AAC CGG AGS CGA CCC TGG TGC TAT GTG CAG GTG GGC CTA AAG 

130 140 
pro leu val gin glu cys met val his asp cys ala asp gly lys lys pro ser ser pro 
CCG CTT GTC CAA GAG TGC ATG GTG CAT GAC TGC GCA GAT GGA AAA AAG CCC TCC TCT CCT 

150 160 
pro glu glu leu lys phe gin cys gly gin lys thr leu arg pro arg phe lys ile ile 
CCA GAA GAA TTA AAA TTT CAG TGT GGC CAA AAG ACT CTG AGG CCC CGC TTT AAG ATT ATT 

170 180 
gly gly glu phe thr thr lie glu asn gin pro trp phe ala ala ile tyr arg arg his 
GGG GGA GAA TTC ACC ACC ATC GAG AAC CAG CCC TGG TTT GCG GCC ATC TAC AGG AGG CAC 

190 200 
arg gly gly ser val thr tyr val cys gly gly ser leu ile ser pro cys trp val ile 
CGG GGG GGC TCT GTC ACC TAC GTG TGT GGA GGC AGC CTC ATC AGC CCT TGC TGG GTG ATC 

210 220 
ser ala thr his cys phe ile asp tyr pro lys lys glu asp tyr ile val tyr leu gly 
AGC GCC ACA CAC TGC TTC ATT GAT TAC CCA AAG AAG GAG GAC TAC ATC GTC TAC CTG GGT 

230 240 
arg ser arg leu asn ser asn thr gin gly glu met lys phe glu val glu asn leu ile 
CGC TCA AGG CTT AAC TCC AAC ACG CAA GGG GAG ATG AAG TTT GAG GTG GAA AAC CTC ATC 

250 260 
leu his lys asp tyr ser ala asp thr leu ala his his asn asp ile ala leu leu lys 
CTA CAC AAG GAC TAC AGC GCT GAC ACG CTT GCT CAC CAC AAC GAC ATT GCC TTG CTG AAG 

270 280 
ile arg ser lys glu gly arg cys ala gin pro ser arg thr ile gin thr ile cys leu 
ATC CGT TCC AAG GAG GGC AGG TGT GCG CAG CCA TCC CGG ACT ATA CAG ACC ATC TGC CTG 
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FIGAB 

290 300 
pro ser met tyr asn asp pro gin phe gly thr ser cys glu ile thr gly phe gly lys 
CCC TCG ATG TAT AAC GAT CCC CAG TTT GGC ACA AGC TGT GAG ATC ACT GGC TTT GGA AAA 

310 320 
glu asn ser thr asp tyr leu tyr pro glu gin leu lys met thr val val lys leu ile 
GAG AAT TCT ACC GAC TAT CTC TAT CCG GAG CAG CTG AAA ATG ACT GTT GTG AAG CTG ATT 

330 340 
ser his arg glu cys gin gin pro his tyr tyr gly ser glu val thr thr lys met leu 
TCC CAC CGG GAG TGT CAG CAG CCC CAC TAG TAC GGC TCT GAA GTC ACC ACC AAA ATG CTA 

350 360 
cys ala ala asp pro gin trp lys thr asp ser cys gin gly asp ser gly gly pro leu 
TGT GCT GCT GAC CCC CAA TGG AAA ACA GAT TCC TGC CAG GGA GAC TCA GGG GGA CCC CTC 

370 380 
val cys ser leu gin gly arg met thr leu thr gly ile val ser trp gly arg gly cys 
GTC TGT TCC CTC CAA GGC CGC ATG ACT TTG ACT GGA ATT GTG AGC TGG GGC CGT GGA TGT 

390 400 
ala leu lys asp lys pro gly val tyr thr arg val ser his phe leu pro trp ile arg 
GCC CTG AAG GAC AAG CCA GGC GTC TAC ACG AGA GTC TCA CAC TTC TTA CCC TGG ATC CGC 

410 411 

ser his thr lys glu glu asn gly leu ala leu OP 

AGT CAC ACC AAG GAA GAG AAT GGC CTG GCC CTC TGA GGGTCCCCAGGGAGGAAACGGGCACCACCC 
GCTTTCTTGCTGGTTGTCATTTTTGCAGTAGAGTCATCTCCATCAGCTGTAAGAAGAGACTGGGAAGATAGGCTCTGC 
ACAGATGGATTTGCCTGTGGCACCACCAGGGTGAACGACAATAGCTTTACCCTCACGGATAGGCCTGGGTGCTGGCTG 
CCCAGACCCTCTGGCCAGGATGGAGGGGT6GTCCTGACTCAACATGTTACTGACCAGCAACTTGTCTTTTTCTGGACT 
GAAGCCTGCAGGAGTTAAAAAGGGCAGGGCATCTCCTGTGCATGGGCTCGAAGGGAGAGCCAGCTCCCCCGACCGGTG 
GGCATTTGTGAGGCCCATGGTTGAGAMTGMTMTTTCCCAATTAGGAAGTGTAAGCAGCTGAGGTCTCTTGAGGGA 
GCTTAGCCAATGTGGGAGCAGCG6TTTGGGGAGCAGAGACACTAACGACTTCAGGGCAGGGCTCT6ATATTCCATGAA 
TGTATCA6GAAATATATAT6TGTGTGTATGTTTGCACACTTGTTGTGTGGGCTGTGAGTGTAAGTGTGAGTAAGAGCT 
GGTGTCTGA1TGTTAA6TCTAAATATTTCCTTAAACTGTGTGGACTGTGATGCCACACAGAGTGGTCTTTCTGGAGAG 
GTTATAG6TCACTCCTGGGGCCTCTTGGGTCCCCCACGTGACAGTGCCTGG6AATGTACTTATTCTGCAGCATGACCT 
GTGACCAGCACTGTCTCAGTTTCACTTTCACATAGATGTCCCTTTCTTGGCCAGTTATCCCTTCCTTTTAGCCTAGTT 
CATCCAATCCTCACTGGGTGGGGTGAGGACCACTCCTTACACTGAATATTTATATTTCACTATTTTTATTTATATTTT 
TGTAATTTTAAATAAAAGTGATCAATAAAATGTGATTTTTCTGA 
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